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WO 96/23928 PCT/US96/Q0977 

METHOD AND COMPOSITIONS FOR TREATING CELLULOSE CONTAINING 
FABRICS USING TRUNCATED CELLULASE ENZYME COMPOSITIONS 

5 Aj. Field of the Invention 

The present invention is directed to improved methods for 
treating cotton-containing fabrics and non-cotton containing 
cellulose fabrics with cellulase as well as to the fabrics 
produced from these methods. In particular, the improved 
10 methods of the present invention are directed to contacting 
cotton-containing fabrics and non-cotton containing fabrics 
with an aqueous solution containing a cellulase composition 
which comprises one or more truncated cellulase enzymes. 

State of the Art 

15 During or shortly after their manufacture, cotton- 

containing fabrics can be treated with cellulase in order to 
impart desirable properties to the fabric. For example, in 
the textile industry, cellulase has been used to improve the 
feel and/or appearance of cotton-containing fabrics, to remove 

20 surface fibers from cotton-containing knits, for imparting a 
stone washed appearance to cotton-containing denims and the 
like. 

In particular, Japanese Patent Application Nos. 58-36217 
and 58-54032 as well as Ohishi et al., "Reformation of Cotton 

25 Fabric by Cellulase" and JTN December 1988 journal article 
"What's New — Weight Loss Treatment to Soften the Touch of 
Cotton Fabric" each disclose that treatment of cotton- 
containing fabrics with cellulase results in an improved feel 
for the fabric. It is generally believed that this cellulase 

30 treatment removes cotton fuzzing and/or surface fibers which 
reduces the weight of the fabric. The combination of these 
effects imparts improved feel to the fabric. 

Additionally, it was heretofore known in the art to treat 
cotton-containing knitted fabrics with a cellulase solution 

3 5 under agitation and cascading conditions, for example, by use 
of a jet, for the purpose of removing broken fibers and 
threads common to these knitted fabrics. 

Clothing made from cellulose fabric, such as cotton 
denim, is stiff in texture due to the presence of sizing 

40 compositions used to ease manufacturing, handling and 
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assembling of clothing items and typically, has a fresh dark 
dyed appearance. One desirable characteristic of indigo-dyed 
denim cloth is the alteration of dyed threads with white 
threads, which gives denim a white on blue appearance. 

After a period of extended wear and laundering, the 
clothing items, particularly denim, can develop in the 
clothing panels and on seams, localized areas of variation in 
the form of a lightening, in the depth or density of color. 
In addition, a general fading of the clothes, some pucker in 
seams and some wrinkling in the fabric panels can often 
appear. Additionally, after laundering, sizing is 
substantially removed from the fabric resulting in a softer 
feel. In recent years such a distressed or "stonewashed" 
look, particularly in denim clothing, has become very 
desirable to a substantial proportion of the public. 

Previous methods for producing the distressed look 
included stonewashing of a clothing item or items in a large 
tub with pumice stones having a particle size of about 1 to 10 
inches and with smaller pumice particles generated by the 
abrasive nature of the process. Typically the clothing item 
is tumbled with the pumice while wet for a sufficient period 
such that the pumice abrades the fabric to produce in the 
fabric panels, localized abraded areas of lighter color and 
similar lightened areas in the seams. Additionally the pumice 
softens the fabric and produces a fuzzy surface similar to 
that produced by the extended wear and laundering of the 
fabric. This method produced the desired white on blue 
contrast described above. 

The use of the pumice stones has several disadvantages, 
including overload damage to the machine motors, mechanical 
damage to transport mechanisms and washing drums, 
environmental waste problems from the grit produced and high 
labor costs associated with the manual removal of the stones 
from the pockets of the garments. 

In view of the problems associated with pumice stones in 
stonewashing, cellulase solutions are used as a replacement 
for the pumice stones under agitating and cascading 
conditions, i.e., in a rotary drum washing machine, to impart 



WO 96/23928 



PCT/US96/00977 



a "stonewashed" appearance to the denim (U.S. Patent No. 
4,832,864) . 

Cellulases are enzymes which hydrolyze cellulose (0-1,4- 
D-glucan linkages) and produce as primary products glucose, 
cellobiose, cellooligosaccharides, and the like. Cellulases 
are produced by a number of microorganisms and comprise 
several different enzyme classifications including those 
identified as exo-cellobiohydrolases (CBH) , endoglucanases 
(EG) and 0-glucosidases (BG) (Schulein, M, 1988 Methods in 
Enzymology 160: 235-242). 

The enzymes within these classifications can be separated 
into individual components. For example, the cellulase 
produced by the filamentous fungus, Trichoderma 
lonqibrachlatuffl , hereafter T. lomHhrarh^Mfl), consists of at 
least two CBH components, i.e., CBHI and CBHII, and at least 
four EG components, i.e., EG1, ECU, EGIII and EGV (Saloheimo, 
A. et al 1993 in Proceedings of the second TRICEL symposium on 
TrichPderma , reegej Cellulases and Other Hydrolases, Espoo, 
Finland, ed by P. Suominen & T. Reinikainen. Foundation for 
Biotechnical and Industrial Fermentation Research 8: 139-146) 
components, and at least one 0-glucosidase. The genes 
encoding these components are namely cbhi . cbh2 . eali . eal2 . 
&3l2, and ea!5 respectively. 

The complete cellulase system comprising CBH, EG and BG 
components synergistically act to convert crystalline 
cellulose to glucose. The two exo-cellobiohyrolases and the 
four presently known endoglucanases act together to hydrolyze 
cellulose to small cello-oligosaccharides. The 
oligosaccharides (mainly cellobioses) are subsequently 
hydrolyzed to glucose by a major 0-glucosidase (with possible 
additional hydrolysis from minor 0-glucosidase components) . 

A problem with the use of complete cellulase compositions 
froin Trichoderma sp. microorganisms and other fungal sources 
for stonewashing dyed denim is the incomplete removal of 
colorant caused by redeposition or backstaining of some of the 
dye back onto the cloth during the stonewashing process, in 
the case of denim fabric, this causes recoloration of the blue 
threads and blue coloration of the white threads, resulting in 
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less contrast between the blue and white threads and abrasion 
points (i.e., a blue on blue look rather than the preferred 
white on blue). See, American Dvestuff Reporter, Sept. 1990, 
pp. 24-28. This redeposition is objectionable to some users. 

Trichoderma cellulases, even though they result in 
backstaining are preferred because of their higher activity on 
denim material. In addition, cellulases with a higher degree 
of purity may be beneficial in the present invention. High 
specific activity or a high level of purity results in a 
higher degree of abrasion in a significantly shorter 
processing time and therefor, is preferable to the denim 
processors. 

Attempts to reduce the amount of redeposition of dye 
included the addition of extra chemicals or enzymes, such as 
surfactants, proteases or other agents, into the cellulase 
wash to help disperse the loosened dye. In addition, 
processors have used less active whole cellulase, along with 
extra washings. However, this results in additional chemical 
costs and longer processing times. Another method includes 
the use of a mild bleach agent or stain removing agent in the 
process. This method affects the garment's final shade and 
increases the processing time. Finally the use of enzymes and 
stones together leave the processor with all the problems 
caused by the use of the stones alone. Accordingly, it would 
be desirable to find a method to prevent redeposition of 
colorant during stonewashing with cellulases. 

Protein analysis of the cellobiohydrolases (CBHI and 
CBHII) and major endoglucanases (EGI and EGII) of 1^ 
lonaibrachiatum has shown that a bifunctional organization 
exists in the form of a catalytic core domain and a smaller 
cellulose binding domain separated by a linker or flexible 
hinge stretch of amino acids rich in proline and hydroxyamino 
acids. Genes for the two cellobiohydrolases, CBHI and CBHII 
(Shoemaker, S et al 1983 Bio/Technology 1, 691-696, Teeri, T 
et al 1983, Bio/Technology 1, 696-699 and Teeri, T. et al, 
1987, Gene 51, 43-52) and two major endoglucansases, EGI and 
EGII (Penttila, M. et al 1986, Gene 45, 253-263, Van Arsdell, 
J.N/ et al 1987 Bio/Technology 5, 60-64 and Saloheimo, M. et 
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al 1988, Gene 63, 11-21) has been isolated from £1. 

iPngifrraghAfitmn and the protein domain structure has been 
confirmed. 

A similar bifunctional organization of cellulase enzymes 
is found in bacterial cellulases. The cellulose binding 
domain (CBD) and catalytic core of Cellulomon^ 
endoglucanase A (C. fimi cen A) has been studied extensively 
(Ong E. et al 1989, Trends Biotechnol. 7:239-243, Pilz et al 
1990, Biochem J • 271:277-280 and Warren et al 1987, Proteins 
1:335-341). Gene fragments encoding the CBD and the CBD with 
the linker have been cloned, expressed in E. coli and shown to 
possess novel activities on cellulose fibers (Gilkes, N.R. et 
al 1991, Microbiol. Rev. 55:305-315 and Din, N et al 1991, 
Bio/Technology 9:1096-1099). For example, isolated CBD from 
Ct fiwi. Cen A genetically expressed in E. coli disrupts the 
structure of cellulose fibers and releases small particles but 
has no detectable hydrolytic activity. CBD further possess a 
wide application in protein purification and enzyme 
immobilization. On the other hand, the catalytic domain of c. 
timi Cen A isolated from protease cleaved cellulase does not 
disrupt the fibril structure of cellulose and instead smooths 
the surface of the fiber. 

Trichoderma lonqibrachiatum cbht core domains have been 
separated proteolytically and purified but only milligram 
quantities are isolated by this biochemical procedure (Offord 
D. , et al 1991, Applied Biochem. and Biotech. 28/29:377-386). 
Similar studies were done in an analysis of the core and 
binding domains of CBHI, CBH11, EGI and EGII isolated from X* 
Ignq^rapfriatVW after biochemical proteolysis, however, only 
enough protein was recovered for structural and functional 
analysis (Tomme, P et al, 1988, Eur. J. Biochem 170:575-581 and 
Ajo, S, 1991 FEBS 291:45-49). Accordingly, the prior art has 
failed to recognize the improvements possible in textile 
processing when using cellulase core domain or cellulose 
binding domain regions. 



SUMMARY O F THE INVENTION 

It is an object of the present invention to provide a 
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method of treating cellulose containing fabrics which- results 
in reduced redeposition of dye at an equivalent level of 
abrasion of the fabric treated over prior art methods. 
It is a further object of the present invention to 
5 provide a detergent composition having the properties of 

reduced redeposition of dye at an equivalent level of abrasion 
over prior art detergent compositions. 

It is a further object of the present invention to 
provide a stonewashing composition having the properties of 
10 reduced redeposition of dye at an equivalent level of abrasion 
over prior art stonewashing compositions. 

According to the present invention a method is provided 
for treating cellulose containing fabrics with cellulase 
comprising the steps of: (a) contacting said cellulose 
15 containing fabric with a treating composition comprising an 
effective amount of truncated cellulase enzyme, derivative 
thereof or naturally occurring cellulase containing no 
cellulase binding domain; and (b) incubating said cellulose 
containing fabric in contact with said truncated c-allulase 
20 enzymes, derivative thereof or naturally occurrin: cellulase 
containing no cellulase binding domain for a time and under 
temperature effective to treat said fabric. Als. , a 
composition is provided for treating a cellulose containing 
fabric comprising a truncated cellulase. In a preferred 
25 embodiment the method of treating comprises laundering or 
stonewashing. Also, preferably, the truncated enzyme 
comprises a truncated cellulase core. Most preferably the 
truncated cellulase core comprises EG I core, EGII core, CBHI 
core, or CBHII core. Further preferably, the cellulase is 
30 present in a concentration of from about 0.1 to 1,000 ppm, 
more preferably from about 0.5 to about 250 ppm. 

According to a preferred embodiment of the present 
invention, a detergent or stonewashing composition is provided 
comprising truncated cellulase enzyme. Because of the 
35 surprising reduced redeposition activity of the truncated 
cellulase enzyme compositions according to the present 
invention, cellulose containing fabrics would be enhanced to a 
surprising extent upon cleaning or stonewashing with a 
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suitable composition comprising truncated enzyme. 

An advantage of the present invention is that a cellulase 
composition comprising truncated cellulase core enzymes, alone 
or in combination with other truncated cellulase core or non- 
truncated cellulases, is provided which confers desirable 
qualities to cellulose containing fabrics. 

BRIEF DB8CRIPTTOK OP THE PBlWTMna 

Figure 1(a) -1(b) depicts the genomic DNA and amino acid 
sequence of CBHI derived from Trichoderma lontHhracMafrum . 
The signal sequence begins at base pair 210 and ends at base 
pair 260 (Seq ID No. 25) . The catalytic core domain begins at 
base pair 261 through base pair 671 of the first exon, base 
pair 739 through base pair 1434 of the second exon, and base 
pair 1498 through base pair 713 of the third exon (Seq ID No. 
9) . The linker sequence begins at base pair 714 and ends at 
base pair 1785 (Seq ID No. 17). The cellulose binding domain 
begins at base pair 1786 and ends at base pair 1888 (Seq ID 
No. l). Seq ID Nos. 26, 10, 18 and 2 represent the amino acid 
sequence of the CBHI signal sequence, catalytic core domain, 
linker region and binding domain, respectively. 

Figure 2 (a) -2(b) depicts the genomic DNA and amino acid 
sequence of CBHI I derived from Trichoderm a lonaibmnM » t 1i r 
The signal sequence begins at base pair 614 and ends at base 
pair 685 (Seq ID No. 27) . The cellulose binding domain begins 
at base pair 686 through base pair 707 of exon one, and base 
pair 755 through base pair 851 of exon two (Seq ID No. 3). 
The linker sequence begins at base pair 852 and ends at base 
pair 980 (Seq ID No. 19) . The catalytic core begins at base 
pair 981 through base pair 1141 of exon two, base pair 1199 
through base pair 1445 of exon three and base pair 1536 
through base pair 2221 of exon four (Seq ID No. 11). Seq ID 
Nos. 28, 4, 20 and 12 represent the amino acid sequence of the 
CBHII signal sequence, binding domain, linker region and 
catalytic core domain, respectively. 

Figure 3 depicts the genomic DNA and amino acid sequence 
of EGI. The signal sequence begins at base pair 113 and ends 
at base pair 178 (Seq ID No. 29). The catalytic core domain 
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begins at base pair 179 through 882 of exon one, and base pair 
963 through base pair 1379 of the second exon (Seq ID No. 13). 
The linker region begins at base pair 1380 and ends at base 
pair 1460 (Seq ID No. 21). The cellulose binding domain 
5 begins at base pair 14 61 and ends at base pair 1616 (Seq ID 

No. 5). Seq ID Nos. 30, 14, 22 and 6 represent the amino acid 
sequence of EGI signal sequence, catalytic core domain, linker 
region and binding domain, respectively. 

Figure 4 depicts the genomic DNA and amino acid sequence 
10 of EGII. The signal sequence begins at base pair 262 and ends 
at base pair 324 (Seq ID No. 31). The cellulose binding 
domain begins at base pair 325 and ends at base pair 432 (Seq 
ID No. 7). The linker region begins at base pair 433 and ends 
at base pair 534 (Seq No. 23). The catalytic core domain 
15 begins at base pair 535 through base pair 590 in exon one, and 
base pair 765 through base pair 1689 in exon two (Seq ID No. 
15). Seq ID Nos. 32, 8, 24 and 16 represent the amino acid 
sequence of EGII signal sequence, binding domain, linker 
region and catalytic core domain, respectively. 
20 Figure 5 depicts the genomic DNA and amino acid sequence 

of EGIII. The signal sequence begins at base pair 151 and 
ends at base pair 198 (Seq ID No. 35). The catalytic core 
domain begins at base pair 199 through base pair 557 in exon 
one, base pair 613 through base pair 833 in exon two and base 
25 pair 900 through base pair 973 in exon three (Seq ID No. 33). 
Seq ID Nos. 36 and 34 represent the amino acid sequence of 
EGIII signal sequence and catalytic core domain, respectively. 

Figure 6 illustrates the construction of EGI core domain 
expression vector (Seq ID No. 37) . 
30 Figure 7 depicts the construction of the expression 

plasmid pTEX (Seq ID Nos. 39-41). 

Figure 8 is an illustration of the construction of CBHI 
core domain expression vector (Seq ID No. 38) . 

Figure 9 is an illustration of the construction of the 
35 CBHII core domain expression vector. 

Figure 10 illustrates abrasion/redeposition results 
obtained which compare EGI with EGI core and EGII with EGII 
core. 

- 8 - 
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Figure 11 illustrates the improvement in 
abrasion/redeposition results of EGIII treated denim when CBHI 
core is added. 

DETAILED DESCRIPTION OF THE INVENTION 
I*. Definitions 

"Cotton-containing fabric" means sewn or unsewn fabrics 
made of pure cotton or cotton blends including cotton woven 
fabrics, cotton knits, cotton denims, cotton yarns and the 
like. When cotton blends are employed, the amount of cotton 
in the fabric should be at least about 40 percent by weight 
cotton; preferably, more than about 60 percent by weight 
cotton; and most preferably, more than about 75 percent by 
weight cotton. When employed as blends, the companion 
material employed in the fabric can include one or more non- 
cotton fibers including synthetic fibers such as polyamide 
fibers (for example, nylon 6 and nylon 66), acrylic fibers 
(for example, polyacrylonitrile fibers), and polyester fibers 
(for example, polyethylene terephthalate) , polyvinyl alcohol 
fibers (for example, Vinylon) , polyvinyl chloride fibers, 
polyvinylidene chloride fibers, polyurethane fibers, polyurea 
fibers and aramid fibers. 

"Cellulose containing fabric" means any cotton or non- 
cotton containing cellulosic fabric or cotton or non-cotton 
containing cellulose blend including natural cellulosics and 
manmade cellulosics (such as jute, flax, ramie, rayon, 
TENCEL™) . Included under the heading of manmade cellulose 
containing fabrics are regenerated fabrics that are well known 
in the art such as rayon. Other manmade cellulose containing 
fabrics include chemically modified cellulose fibers (e.g, 
cellulose derivatized by acetate) and solvent-spun cellulose 
fibers (e.g. lyocell) . Of course, included within the 
definition of cellulose containing fabric is any garment or 
yarn made of such materials. Similarly, "cellulose containing 
fabric" includes textile fibers made of such materials. 

"Treating composition" means a composition comprising a 
truncated cellulase component which may be used in treating a 
cellulose containing fabric. Such treating includes, but is 
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not limited to, stonewa shying, modifying- the- texture., -feel 
and/or appearance of cellulose containing fabrics or other 
techniques used during manufacturing of cellulose containing 
fabrics. Additionally, treating within the context of this 
5 invention contemplates the removal of "dead cotton", from 
cellolosic fabric or fibers, i.e. immature cotton which is 
significantly more amorphous than mature cotton. Dead cotton 
is known to cause uneven dyeing. Additionally, "treating 
composition" means a composition comprising a truncated 
10 cellulase component which may be used in washing of a soiled 
manufactured cellulose containing fabric. For example, 
truncated cellulase may be used in a detergent composition for 
washing laundry. Detergent compositions useful in accordance 
with the present invention include special formulations such 
15 as pre-wash, pre-soak and home-use color restoration 

compositions. Treating compositions may be in the form of a 
concentrate which requires dilution or in the form of a dilute 
solution or form which can be applied directly to the 
cellulose containing fabric. 
20 It is Applicants' present belief that the action pattern 

of cellulase upon cellulose containing fabrics does not differ 
significantly whether used as a stonewashing composition 
during manufacturing or during laundering of a soiled 
manufactured cellulose containing fabric. Thus, improved 
25 properties such as abrasion, redeposition of dye, strength 
loss and improved feel conferred by a certain cellulase or 
mixture of cellulases are obtained in both detergent and 
manufacturing processes incorporating cellulase. Of course, 
the formulations of specific compositions for the various 
30 textile applications of cellulase, e.g., stonewashing or 

laundry detergent or pre-soak, may differ due to the different 
applications to which the respective compositions are 
directed, as indicated herein. However, the improvements 
effected by the addition of cellulase compositions will be 
35 generally consistent through each of the various textile 
applications. 

••Stonewashing composition" means a formulation for use in 
stonewashing cellulose containing fabrics. Stonewashing 

- 10 - 
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compositions are used to modify cellulose containing fabrics 
prior to presentation for consumer sale, i.e., during the 
manufacturing process, in contrast to detergent compositions 
which are intended for the cleaning of soiled garments. 

"Stonewashing" means the treatment of colored cellulose 
containing fabric with a cellulase solution under agitating 
and cascading conditions, i.e., in a rotary drum washing 
machine, which impart a "stonewashed" appearance to the denim. 
Methods for imparting a stonewashed appearance to denim are 
described in U.S. Patent No. 4,832,864 which is incorporated 
herein by reference in its entirety. Generally, stonewashing 
techniques have been applied to dyed denim. 

"Detergent composition 11 means a mixture which is intended 
for use in a wash medium for the laundering of soiled 
cellulose containing fabrics. In the context of the present 
invention, such compositions may include, in addition to 
cellulases and surfactants, many additives, including, but not 
limited to, additional hydrolytic enzymes, builders, bleaching 
agents, bluing agents and fluorescent dyes, caking inhibitors, 
masking agents, cellulase activators, antioxidants, and 
solubilizers may be included. Such compositions are generally 
used for cleaning soiled garments and are not used during the 
manufacturing process, in contrast to stonewashing 
compositions. 

"Redepositing cellulase" means cellulases which in the 
enzymatic stonewashing or other treatment of cellulose 
containing fabrics using cellulase solutions, particularly 
denim, result in redeposition of dye onto the substrate. This 
effect is often referred to as backstaining. Such 
backstaining of the fabric leads to incomplete stonewashing 
because instead of the desired blue on white contrast, the 
redeposition results in blue on blue. Redepositing cellulases 
include those derived from microorganisms such as the fungal 
microorganism Trichoderma SB^ and the like. In particular, 
ESI , EGII, CBHI and CBHII are known to exhibit significant 
redeposition behavior in their non-truncated state. 

"Surface active agent or surfactant" means anionic, non- 
ionic and ampholytic surfactants well known for their use in 
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detergent compositions. 

"Wash medium" means an aqueous wash solution prepared by 
adding a requisite amount of a detergent composition to water. 
The wash medium generally contains a cleaning effective amount 

of the detergent. 

"Cellulolytic enzymes" or "Cellulase enzymes" means 
fungal exoglucanases or exo-cellobiohydrolases, 
endoglucanases, and 0-glucosidases. These three different 
types of cellulase enzymes act synergistically to convert 
cellulose and its derivatives to glucose. Analysis of the 
genes coding for CBHI, CBHII, EGI, EGII and EGV in Trjchoderma , 
lonaibrachiaturo shows a domain structure comprising a 
catalytic core region or domain (CCD) , a hinge or linker 
region (used interchangeably herein) and cellulose binding 
region or domain (CBD) . 

A cellulase composition produced by a naturally occurring 
source and which comprises one or more cellobiohydrolase type 
and endoglucanase type components wherein each of these 
components is found at the ratio produced by the source is 
sometimes referred to herein as a "complete cellulase system" 
or a "complete cellulase composition" to distinguish it from 
the classifications and components of cellulase isolated 
therefrom, from incomplete cellulase compositions produced by 
bacteria and some fungi, from a cellulase composition obtained 
from a microorganism genetically modified so as to 
overproduce, underproduce, or not produce one or more of the 
cellobiohydrolase type and/or endoglucanase type components of 
cellulase, or from a truncated cellulase enzyme composition, 

as defined herein. 

The present invention specifically contemplates 
applicability to cellulases which contain core and binding 
domain regions as defined herein. For example, bacterial 
cellulases from T h f ™ ft " QSPO ™ so.. Cellu)ompnas sp. , Bacillus 

^nrinmonas sc. ■ str ops nmvces sp - are known to possess 
both a binding domain region and a core region. 

Preferred for use in this invention are fungal 
cellulases. More preferably, the fungal cellulases are 
derived from TT A "hndarma sp. . including Trichoderma 
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lonctibrachiatum. Trichodgrma vir^ e. Trlchoderma *ftn , ?i mii, 
Penicillium so.. Humicola. ?p , , including Humicola i n C o ] ?n ff| 
ASPgrqil l ys sp,, and Fumariuin r P . as used herein, the term 
" Trich . pd . enna " or "Tri anode™* RP| «. refers to any fungal 
strains which have previously been classified as Triehedenn* 
or which are currently classified as Trichod*™,, . „ost 
preferably, the cellulase's derived from Trichoderum 

IgnqibrflChjattiffl or Trichoder roa viridft . 

"Fungal cellulase" means an en2yme composition derived 
from fungal sources or microorganisms genetically modified so 
as to incorporate and express all or part of the cellulase 
genes obtained from a fungal source. Fungi capable of 
producing cellulases useful in preparing cellulase 
compositions described herein are disclosed in British Patent 
No. 2 094 826A, the disclosure of which is incorporated herein 
by reference. 

Most fungal cellulases generally have their optimum 
activity in the acidic or neutral pH range although some 
fungal cellulases are known to possess significant activity 
under neutral and slightly alkaline conditions, i.e., for 
example, cellulase derived from Humicola insm»n f is knovm to 
have activity in neutral to slightly alkaline conditions. 

Fungal cellulases are known to be comprised of several 
enzyme classifications having different substrate specificity, 
enzymatic action patterns, and the like. Additionally, enzyme 
components within each classification can exhibit different 
molecular weights, different degrees of glycosylation, 
different isoelectric points, different substrate specificity 
etc. For example, fungal cellulases can contain cellulase 
classifications which include endoglucanase type components 
(hereinafter "EG-type") , exo-cellobiohydrolase type components 
(hereinafter "CBH-type M ) , 0-glucosidase type components 
(hereinafter "BG-type") , etc. On the other hand, while 
bacterial cellulases are reported in the literature as 
containing little or no CBH-type components, there are a few 
cases where CBH-type components derived from bacterial 

cellulases have been reported to possess exo-cellobiohydrolase 
activity. 



- 13 - 



WO 96/23928 



■ 

PCT/US96/00977 



» 



The fermentation procedures for culturing fungi and for 
production of cellulase are known per se in the art. For 
example, cellulase systems can be produced either by solid or 
submerged culture, including batch, fed-batch and continuous- 
5 flow processes. The collection and purification of the 
cellulase systems from the fermentation broth can also be 
effected by procedures known per se in the art. 

"Endoglucanase-type components" or "EG-type" means fungal 
cellulase components or a combination of components which 

10 exhibit textile activity properties similar to the 

endoglucanase components of TT-ichoderma lonaibrachiatum 
(previously classified as Trichoderraa reesei) . In this 
regard, the endoglucanase components of TrjchOderma, 
1 frprHhr achiatum (for example, EGI, EGII, EGIII, and EGV, 

15 either alone or in combination) are known to impart improved 
feel, improved appearance, softening, color enhancement, 
and/ or a stonewashed appearance to denim fabrics (as compared 
to the fabric prior to treatment) when these components are 
incorporated into a textile treatment medium and the fabric is 

20 treated with this medium. 

Accordingly, endoglucanase type components are those 
cellulase components which impart specific enhancements to 
cellulose containing fabrics, such as improved feel, improved 
appearance, softening, color enhancement, and/or a stonewashed 
25 appearance (as compared to the fabric before treatment) when 
these components are incorporated into a medium used to treat 
the fabrics. Certain EG type components, including EGI, EGII 
and EGIII, may impart reduced strength loss to denim fabrics 
as compared to the strength loss arising from treatment with a 
30 similar cellulase composition but which additionally contains 
CBH I type components. 

Endoglucanase type components as defined herein may not 
include components traditionally classified as endoglucanases 
using activity tests such as the ability of the component (a) 
35 to hydrolyze soluble cellulose derivatives such as 

carboxymethylcellulose (CMC) , thereby reducing the viscosity 
of CMC containing solutions, (b) to readily hydrolyze hydrated 
forms of cellulose such as phosphoric acid swollen cellulose 



- 14 - 



WO 96/23928 



PCT/US96/00977 



(e.g., Walseth cellulose) and hydrolyze less readily the more 
highly crystalline forms of cellulose (e.g., Avicel, 
Solkafloc, etc.). On the other hand, it is believed that not 
all endoglucanase components, as defined by such activity 
tests, will impart one or more of the enhancements to 
cellulose containing fabrics, including reduced strength loss. 
Accordingly, for the purposes herein, it is appropriate to 
define endoglucanase type components as those components of 
fungal cellulase which possess similar textile modification 
properties as possessed by the endoglucanase components of 
TrjcfcojsrflE lonoibrachiatum. 

The different components generally have different 
properties, such as isoelectric point, molecular weight, 
degree of glycosylation, substrate specificity and enzymatic 
action patterns. The different isoelectric points of the 
components allow for their separation via techniques such as 
ion exchange chromatography. In fact, the isolation of 
components from different sources is known in the art. See, 
for example, Bjork et al., U.S. Patent No. 5,120,463; Schulein 
et al.. International Application WO 89/09259; Wood et al., 

BiPShemistrv and Genetics of cellulo se Degradation . pp . 31-52 

(1988); Wood et al., Carbohydr ate Research , Vol. 190, pp. 279- 
297 (1989); Schulein, Methods i n Enzymoloov . Vol. 160, pp. 
234-242 (1988); and the like. The entire disclosure of each 
of these references is incorporated herein by reference. 

The term "EG 1 11 refers to an endoglucanase type component 
typically derived from, or embodying the identifying 
characteristics of those derived from, EG I of Trichoderma s Pt 
Thus, EGl refers to an endoglucanase derived from Trichoderma 
so. characterized by a pH optimum of about 4.0 to 6.0, an 
isoelectric point (pi) of from about 4.5 to 4.7, and a 
molecular weight of about 47 to 49 Kdaltons. Preferably, EG I 
is derived from either Trichode rma lonaibrachiatum or from 
Trichoderma viride . EG I derived from Trichoderma 
lonaibr achiatum has a pH optimum of about 5.0, an isoelectric 
point (pi) of about 4.7 and a molecular weight of about 4 7 to 
49 Kdaltons. EG I cellulase derived from Trichoderma viride 
has a pH optimum of about 5.0, an isoelectric point (pi) of 
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about 5.3 and a molecular weight of about 50 Kdaltons. 

The term "EGII" as defined herein refers to an 
endoglucanase type component typically derived from, or 
embodying the identifying characteristics of those derived 
from, EGII of TxighaflfigBfi so. It is noted that EGII has been 
previously referred to by the nomenclature "EGIII" by some 
authors but current nomenclature uses the term EGII. In any 
event, the EGII protein defined herein is substantially 
different from the EGIII protein in its molecular weight, pi 
and pH optimum. The term "EGII cellulase" refers to the 
endoglucanase component derived from TrichQderma spp . 
characterized by a pH optimum of about 4.0 to 6.0 an 
isoelectric point (pi) of from about 5.5, and a molecular 
weight of about 48 Kdaltons. Preferably, EGII cellulase is 
derived from either Tr ifihoder ^ a lonoi orach i a turn or from 

TT-ichoc ^rpa viride. 

The term "EGIII cellulase" as defined herein refers to an 
endoglucanase type component typically derived from, or 
embodying the identifying characteristics of those derived 
from, EGIII of Tr ^™*«™« so. Thus, EGIII refers to the 
endoglucanase component derived from Trichoderma spp . 
characterized by a pH optimum of about 5.0 to 7.0, an 
isoelectric point (pi) of from about 7.2 to 8.0, and a 
molecular weight of about 23 to 28 Kdaltons. Preferably, 
EGIII cellulase is derived from either Trjchpderma 
^P^v^hiatum or from Tricnort»rm? v^ide. EGIII cellulase 
derived from Tri^»™ a loncn brachial has a pH optimum of 
about 5.5 to 6.0, an isoelectric point (pi) of about 7.4 and a 
molecular weight of about 25 to 28 Kdaltons. EGIII cellulase 
derived from T rj~tr*-™» viride has a pH optimum of about 5.5, 
an isoelectric point (pi) of about 7.7 and a molecular weight 

of about 23.5 Kdaltons. 

The term "EGV cellulase" as defined herein refers to an 
endoglucanase type component typically derived from, or 
embodying the identifying characteristics of those derived 
from, EGV of Tr ^hortPr ma so. Thus EGV refers to the 
endoglucanase component derived from Trichodgrma gp. 
characterized by Saloheimo et al., Malecular Microbiology 
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13(2) :219-228 (1994); and Proceedings of the Second TRICEL 
Symposium on Trichoderma reesei Cellulases and other 
Hydrolases, Esppo Finland 1993, ed. P. Suominen & J.T. 
Reinikainen. Foundation for Biotechnical and Industrial 
Fermentation Research 8, pp. 139-146 (1993), 

w Exo-cellobiohydrolase type components" or "CBH-type" 
means fungal cellulase components which exhibit textile 
activity properties similar to CBH I and/or CBH II cellulase 
components of Trichoderma lonaibr nchiatum . in this regard, 
when used in the absence of EG type cellulase components (as 
defined above) , CBH I and CBH II components of Trichoderma 
lPnqjfrrachiiPtMin alone are recognized as not imparting any 
significant enhancements in feel, appearance, softening color 
enhancement and/or stonewashed appearance to denim fabrics. 
Additionally, when used in combination with some EG type 
components, in a ratio of approximately 2.5:1 of CBH I to EG 
components, the CBH I component of Trichoderm a lonaibraeh^t tin 
imparts enhanced strength loss to the denim fabrics. 

Accordingly, CBH I type components and CBH II type 
components refer to those fungal cellulase components which 
exhibit textile activity properties similar to CBH I and CBH 
II components of Trichoderma loncri brachiatum r respectively. 
As noted above, for CBH I type components, this includes the 
property of increasing strength loss in denim fabrics treated 
with CBH I in the presence of EG type components. 

Exo-cellobiohydrolase type components defined herein may 
not include components traditionally classified as exo- 
cellobiohydrolases using activity tests such as those used to 
characterize CBH I and CBH II from TrichoderTn* 
lonqibrachiatum . For example, exo-cellobiohydrolase type 
components (a) are competitively inhibited by cellobiose (K t 
approximately lmM) ; (b) are unable to hydrolyze substituted 
celluloses to any significant degree, such as 
carboxymethylcellulose, and (c) hydrolyze phosphoric acid 
swollen cellulose and to a lesser degree highly crystalline 
cellulose. On the other hand, it is believed that some 
cellulase components which are characterized as CBH type 
components by these activity tests, will impart improved feel, 



WO 96/23928 



PCT/US96/00977 



appearance, softening, color enhancement, and/or a stonewashed 
appearance to cellulose-containing fabrics when used alone in 
the cellulase composition. Accordingly, it is believed to be 
more accurate for the purposes herein to define such 
exo-cellobiohydrolases as EG-type components because these 
components possess similar functional properties in textile 
uses to those possessed by the endoglucanase components of 

Trichoderma lppqjfrrflchiatvro* 

In general, cellulase compositions comprising the various 
components of a complete cellulase composition ("whole 
cellulase 91 can be obtained by purification techniques based on 
their known characteristics and properties. Specifically, the 
whole cellulase can be purified into substantially pure 
components by recognized separation techniques published in 
the literature, including ion exchange chromatography at a 
suitable pH, affinity chromatography, size exclusion and the 
like. For example, in ion exchange chromatography (usually 
anion exchange chromatography) , it is possible to separate the 
cellulase components by eluting with a pH gradient, or a salt 
gradient, or both a pH and a salt gradient. After 
purification, the requisite amount of the desired components 

could be recombined. 

The term "truncated cellulase", as used herein, refers to 
a protein comprising a truncated cellulase catalytic core or 
truncated cellulose binding domain of exo-cellobiohydrolase or 
endoglucanase, for example, EG I type, EGII type, EGV type, 
CBHI type, CBHII type, or derivatives thereof. As stated 
above, many cellulase enzymes are believed to be bifunctional 
in that they contain domains which are directed toward both 
catalytic or hydrolytic activity with respect to the cellulose 
substrate, and also non-catalytic cellulose binding activity. 
Thus, a truncated cellulase is a cellulase which in an intact 
form, contains both a core and a binding domain but which is 
treated so as to lack one or the other domain. 

The catalytic core and the cellulose binding domain of a 
cellulase enzyme are believed to act together in a synergistic 
manner to effect efficient and often deleterious hydrolysis of 
cellulose fibers in a cellulose containing fabric. Further, 
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cellulase catalytic activity and cellulose binding activity 
are believed to be specific to distinct structural domains. 
For example, as indicated above, many cellulase enzymes, 
including cellulases from, for example, T. loncnh^achiatmn »tv* 
Hliffiisalfl Inaalfing are known to incorporate a catalytic core 
domain subunit which is attached via a linker region to a 
cellulose binding domain subunit. 

A "truncated cellulase derivative" encompasses a 
truncated cellulase core or truncated cellulose binding 
domain, as defined herein, wherein there may be an addition or 
deletion of one or more amino acids to either or both of the 
C- and N- terminal ends of the truncated cellulase, or a 
substitution, insertion or deletion of one or more amino acids 
at one or more sites throughout the truncated cellulase. 
Derivatives include mutants that preserve their character as 
truncated cellulase core or truncated cellulose binding 
domain, as defined below. It is also intended by the term 
"derivative of a truncated cellulase" to include core or 
binding domains of the exoglucanase or endoglucanase enzymes 
that have attached thereto one or more amino acids from the 
linker region. 

A truncated cellulase derivative further refers to a 
protein substantially similar in structure and biological 
activity to a truncated cellulase core or truncated cellulose 
binding domain protein, but which has been genetically 
engineered to contain a modified amino acid sequence. Thus, 
provided that the two proteins possess substantially similar 
activity, they are considered "derivatives" even if the 
primary structure of one protein does not possess the 
identical amino acid sequence to that found in the other. 

It is contemplated that a truncated cellulase derivative 
may be derived from a DNA fragment encoding a catalytic 
truncated core or a truncated cellulose binding domain which 
further contains an addition of one or more nucleotides 
internally or at the 5' or 3' end of the DNA fragment, a 
deletion of one or more nucleotides internally or at the 5' or 
3' end of the DNA fragment or a substitution of one or more 
nucleotides internally or at the 5' or 3' end of the DNA 
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fragment wherein the functional activity of the expressed 
truncated cellulose binding or catalytic core domain 
(truncated cellulase derivative) is retained. A DNA fragment 
encoding a cellulase catalytic core or cellulose binding 
domain may further include a linker or hinge DNA sequence or 
portion thereof attached to the core or binding domain DNA 
sequence at either the 5' or 3' end wherein the functional 
activity of the encoded truncated cellulose binding domain or 
cellulase core domain (truncated cellulase derivative) is 
retained. 

The term "truncated cellulase core" or "truncated 
cellulase region" refers herein to a peptide comprising the 
catalytic core domain of exo-cellobiohydrolase or 
endoglucanase, for example, EGI type, EGII type, EGV type, 
CBHI type or CBHII type, or a derivative thereof that is 
capable of enzymatically cleaving cellulose polymers, 
including but not limited to pulp or phosphoric acid swollen 
cellulose. However, a truncated cellulase core will not 
possess cellulose binding activity attributable to a cellulose 
binding domain. A truncated cellulase core is distinguished 
from a non-truncated cellulase which, in an intact form, 
possesses poor cellulose binding activity. A truncated 
cellulase core may include other entities which do not include 
cellulose binding activity attributable to a cellulose binding 
domain. For example, the presence of a linker or hinge is 
specifically contemplated. Similarly, the covalent attachment 
of another enzymatic entity to the truncated cellulase core is 
also specifically contemplated. 

The performance (or activity) of a protein containing a 
truncated catalytic core or a derivative thereof may be 
determined by methods well known in the art. (See Wood, T.H. 
et al in Methods in Enzymology, Vol. 160, Editors: Wood, W.A. 
and Kellogg, S.T. , Academic Press, pp. 87-116, 1988) For 
example, such activities can be determined by hydrolysis of 
phosphoric acid-swollen cellulose and/or soluble 
oligosaccharides followed by quantification of the reducing 
sugars released. In this case the soluble sugar products, 
released by the action of CBH or EG cellulase core domains or 
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derivatives thereof, can be detected by HPLC analysis or by 
use of colorimetric assays for measuring reducing sugars. It 
is expected that these catalytic domains or derivatives 
thereof will retain at least 10% of the activity exhibited by 
the intact enzyme when each is assayed under similar 
conditions and dosed based on similar amounts of catalytic 
domain protein. 

The term "truncated cellulose binding domain" refers 
herein to a peptide or group of related peptides comprising 
cellulose binding activity of an exo-cellobiohydrolase or an 
endoglucanase, for example, EGI type, EGII type, EGV type, 
CBHI type or CBHII type, or a derivative thereof that non- 
covalently binds to a polysaccharide such as cellulose. It is 
believed that cellulose binding domains attach the enzyme to 
cellulose and function independently from the catalytic core 
of the cellulase enzyme. A truncated cellulose binding domain 
will not possess the significant hydrolytic activity 
attributable to a catalytic core. A truncated cellulose 
binding domain is distinguished from a non-truncated cellulase 
which, in an intact form, possesses no cellulase catalytic 
core. A truncated cellulose binding domain may include other 
entities which do not include cellulose cleavage activity 
attributable to cellulase catalytic core. For example, the 
presence of a linker or hinge is specifically contemplated. 
Similarly, the covalent attachment of another enzymatic entity 
to the truncated cellulose binding domain is also specifically 
contemplated. 

The performance (or activity) of a truncated cellulose 
binding domain or derivatives thereof as described in the 
present invention may be determined by cellulose binding 
assays using cellulose substrates such as avicel, pulp or 
cotton. It is expected that these novel truncated cellulose 
binding domains or derivatives thereof will retain at least 
10% of the binding affinity compared to that exhibited by the 
non-truncated enzyme when each is assayed under similar 
conditions and dosed based on similar amounts of binding 
domain protein. The amount of non-bound binding domain may be 
quantified by direct protein analysis, by chromatographic 



WO 96/23928 



PCT/US96/00977 



i 



methods, or possibly by immunological methods. 

Other methods well known in the art that measure 
cellulase catalytic and/or binding activity via the physical 
or chemical properties of particular treated substrates may 
5 also be suitable in the present invention. For example, for 
methods that measure physical properties of a treated 
substrate, the substrate is analyzed for modification of 
shape, texture, surface, or structural properties, 
modification of the "wet" ability, e.g. substrates ability to 
10 absorb water, or modification of swelling. Other parameters 
which may determine activity include the measuring of the 
change in the chemical properties of treated solid substrates. 
For example, the diffusion properties of dyes or chemicals may 
be examined after treatment of solid substrate with the 
15 truncated cellulase binding protein or derivatives thereof 
described in the present invention. Appropriate substrates 
for evaluating activity include Avicel, rayon, pulp fibers, 
cotton or ramie fibers, paper, kraft or ground wood pulp, for 
example. (See also Wood, T.M. et al in "Methods in 
20 Enzymology" , Vol. 160, Editors: Wood, W.A. and Kellogg, S.T. , 
Academic Press, pp. 87-116, (1988)). 

In addition to truncated cellulase core, naturally 
occurring cellulases which lack a binding domain possess the 
advantages of the truncated catalytic cores defined herein. 
25 For example, bacterial cellulase derived from Erwinlfr 

carotovora are believed to possess no binding domain, (see 
e.g., Saarilahti, et al. Gene, Vol. 90, pp. 9-14 (1990)). 
Similarly, cellulases derived from Clostrid ium thermocellum. 
also are believed to possess no binding domain (see e.g. 
30 Gilkes, et al., Microbiological Reviews, pp. 303-315 (1991)). 
Thus, the use of naturally occurring cellulases having no 
binding domain will provide decreased backstaining at 
equivalent levels of abrasion to known cellulases. 

f, fl-Glucosidase (BG) components" means components of a 
35 complete cellulase composition which exhibit BG activity; that 
is to say that such components will act from the non-reducing 
end of cellobiose and other soluble cellooligosaccharides 
("cellobiose") and give glucose as the sole product. BG 
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components do not adsorb onto or react with cellulose 
polymers. Furthermore, such BG components are competitively 
inhibited by glucose (K f approximately ImM) . While in a strict 
sense, BG components are not literally cellulases because they 
cannot degrade cellulose, such BG components are included 
within the definition of the cellulase system because these 
enzymes facilitate the overall degradation of cellulose by 
further degrading the inhibitory cellulose degradation 
products (particularly cellobiose) produced by the combined 
action of CBH type components and EG type components. 

Methods to either increase or decrease the amount of BG 
components in the cellulase composition is disclosed in WO 
92/10581 which application is incorporated herein by reference 
in its entirety, 

"Linker or hinge region" means a short peptide region 
that links together structurally distinct catalytic core and 
cellulose binding domains of a cellulase. These domains in 
lPPqjfrrgchifltVlQ cellulases, for example, are linked by a 
peptide rich in Ser, Thr and Pro. 

"Signal sequence" means a sequence of amino acids bound 
to the N-terminal portion of a protein which facilitates the 
secretion of the mature form of the protein outside of the 
cell. This definition of a signal sequence is a functional 
one. The mature form of the extracellular protein lacks the 
signal sequence which is cleaved off during the secretion 
process . 

"Host cell" means a cell which acts as a host for a 
recombinant DNA vector according to the present invention. 
In a preferred embodiment according to the present invention, 
"host cell" means both the cells and protoplasts created from 
the cells of Trichoderma sd. 

"DNA construct or vector" (used interchangeably herein) 
means a nucleotide sequence which comprises one or more DNA 
fragments or DNA variant fragments encoding any of the novel 
truncated cellulases or derivatives described above. 

"Functionally attached to" means that a regulatory 
region, such as a promoter, terminator, secretion signal or 
enhancer region is attached to a structural gene and controls 
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the expression of that gene. 

"Buffer" means art recognized acid/base reagents which 
stabilize the cellulase solution against undesired pH shifts 
during cellulase treatment of the cellulose containing fabric. 
In this regard, it is art recognized that cellulase activity 
is pH dependent. For example, a specific cellulase 
composition will exhibit cellulolytic activity within a 
defined pH range with optimal cellulolytic activity generally 
being found within a small portion of this defined range. The 
specific pH range for cellulolytic activity will vary with 
each cellulase composition. As noted above, while most 
cellulases will exhibit cellulolytic activity within an acidic 
to neutral pH profile, there are some cellulase compositions 
which exhibit cellulolytic activity in an alkaline pH 
environment. 

During cellulase treatment of the cellulose containing 
fabric, it is possible that the pH of the initial cellulase 
solution could be outside the range required for cellulase 
activity. It is further possible for the pH to change during 
treatment of the cellulose containing fabric, for example, by 
the generation of a reaction product which alters the pH of 
the solution. In either event, the pH of an unbuffered 
cellulase solution could be outside the range required for 
cellulolytic activity. When this occurs, undesired reduction 
or cessation of cellulolytic activity in the cellulase 

solution occurs. 

In view of the above, the pH of the cellulase solution 
should be maintained within the range required for 
cellulolytic activity. One means of accomplishing this is by 
simply monitoring the pH of the system and adjusting the pH as 
required by the addition of either an acid or a base. 
However, in a preferred embodiment, the pH of the system is 
preferably maintained within the desired pH range by the use 
of a buffer in the cellulase solution. In general, a 
sufficient amount of buffer is employed so as to maintain the 

■ 

pH of the solution within the range wherein the employed 
cellulase exhibits activity. Insofar as different cellulase 
compositions have different pH ranges for exhibiting cellulase 
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activity, the specific buffer employed is selected in 
relationship to the specific cellulase composition employed. 
The buffer (s) selected for use with the cellulase composition 
employed can be readily determined by the skilled artisan 
taking into account the pH range and optimum for the cellulase 
composition employed as well as the pH of the cellulase 
solution. Preferably, the buffer employed is one which is 
compatible with the cellulase composition and which will 
maintain the pH of the cellulase solution within the pH range 
required for optimal activity. Suitable buffers include 
sodium citrate, ammonium acetate, sodium acetate, disodium 
phosphate, and any other art recognized buffers. 

IL. Preparation of Truncated Ce llulase Engymg*; 

The present invention relates to the use of truncated 
cellulases and derivatives of truncated cellulases. These 
enzymes are preferably prepared by recombinant methods. 
Additionally, truncated cellulase proteins for use in the 
present invention may be obtained by other art recognized 
means such as chemical cleavage or proteolysis of complete 
cellulase protein. 

A preferred mode for preparing truncated cellulase 
according to the present invention comprises obtaining a 
modified Trichoderma sp. host cell that is missing one or more 
cellulase activities. The modified Trichoderma sp. cell is 
transformed with a DNA construct comprising at least a 
fragment of DNA encoding a portion or all of the binding or 
core region of a exo-cellobiohydrolase or an endoglucanase, 
for example, EGI, EGII, EGV, CBHI or CBHII functionally 
attached to a promoter. The transformed host cell is then 
grown under conditions so as to express the desired protein. 
Subsequently, the desired protein product is purified to 
substantial homogeneity. 

Preferably, the microorganism to be transformed comprises 
a strain derived from Trichoderma sp. More preferably, the 
strain comprises T, lonqibrachiatum cellulase over-producing 
strain. For example, RL-P37, described by Sheir-Neiss et al. 
in Appl. Microbiol. Biotechnology, 20 (1984) pp. 46-53 is 
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known to secrete elevated amounts of cellulase enzymes. 
Functional equivalents of RL-P37 include Trichoderma 
lonaibrachiatum strain RUT-C30 (deposited under deposit no, 
ATCC No. 56765} and ATCC deposit nos. 58351, 58352, and 58353. 
5 Still more preferably, the Trichodermg host cell strains 

have had one or more cellulase genes deleted prior to 
introduction of a DNA construct or plasmid containing the DNA 
fragment encoding the truncated cellulase protein. Such 
strains may be prepared by the method disclosed in U.S. Patent 

10 No. 5,246,853 and WO 92/06209, which disclosures are hereby 
incorporated by reference. By expressing a truncated 
cellulase in a host microorganism that is missing one or more 
cellulase genes, the identification and subsequent 
purification procedures are simplified. Any gene from 

15 Trichoderma sp. which has been cloned can be deleted, for 
example, the cbhl . eall , and eg 13 genes • 

Gene deletion may be accomplished by inserting a form of 
the desired gene to be deleted or disrupted into a plasmid by 
methods known in the art. An appropriate deletion plasmid 

20 will generally have unique restriction enzyme sites present 
therein to enable the fragment of homologous Trichoderma so. 
DNA to be removed as a single linear piece. The deletion 
plasmid is then cut at an appropriate restriction enzyme 
site(s), internal to the coding region, and the gene coding 

25 sequence or part thereof replaced with a selectable marker. 
Flanking DNA sequences from the locus of the gene to be 
deleted or disrupted, preferably between about 0.5 to 2.0 kb, 
remain on either side of the selectable marker gene. 

A selectable marker must be chosen so as to enable 

30 detection of the transformed fungus. Any selectable marker 

gene which is expressed in the selected microorganism will be 
suitable. For example, with Trichoderma sp. . the selectable 
marker is chosen so that the presence of the selectable marker 
in the transf ormants will not significantly affect the 

35 properties thereof. Such a selectable marker may be a gene 

which encodes an assayable product. For example, a functional 
copy of a Trichoderma sp gene may be used which if lacking in 
the host strain results in the host strain displaying an 
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auxotrophic phenotype. 

In a preferred embodiment, a pvr4 1 derivative strain of 
Trichoderma sd« is transformed so that one or more cellulase 
genes are replaced by the pvr4 gene, thus providing a 
selectable marker. A pyr4 1 derivative strain may be obtained 
by subjecting Trichoderma bp. strains to fluoroorotic acid 
(FOA) . The ovr4 gene encodes orotidine-5' -monophosphate 
decarboxylase, an enzyme required for the biosynthesis of 
uridine. Strains with an intact pvr4 gene grow in a medium 
lacking uridine but are sensitive to fluoroorotic acid. It is 
possible to select pvr4- derivative strains which lack a 
functional orotidine monophosphate decarboxylase enzyme and 
require uridine for growth by selecting for FOA resistance. 
Using the FOA selection technique it is also possible to 
obtain uridine requiring strains which lack a functional 
orotate pyrophosphoribosyl transferase. It is possible to 
transform these cells with a functional copy of the gene 
encoding this enzyme (Berges and Barreau, 1991, Curr. Genet. 
19 pp359-365) . Selection of derivative strains is easily 
performed using the FOA resistance technique referred to 
above, and thus, the pyr4 gene is preferably employed as a 
selectable marker. 

To transform ovr4- Trichoderma sp. so as to be lacking in 
the ability to express one or more cellulase genes, a single 
DNA fragment comprising a disrupted or deleted cellulase gene 
is then isolated from the deletion plasmid and used to 
transform an appropriate pyr " Trichoderma host. Transf ormants 
are then identified and selected based on their ability to 
express the pyr4 gene product and thus compliment the uridine 
auxotrophy of the host strain. Southern blot analysis is then 
carried out on the resultant transformants to identify and 
confirm a double cross over integration event which replaces 
part or all of the coding region of the gene to be deleted 
with the pvr4 selectable markers. 

Although the specific plasmid vectors described above 
relate to preparation of pyr^ transformants, the present 
invention is not limited to these vectors. Various genes can 
be deleted and replaced in the Trichoderma sd. strain using 
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the above techniques. In addition, any available selectable 
markers can be used, as discussed above. In fact, any 
Trichoderma sp. gene which has been cloned, and thus 
identified, can be deleted from the genome using the above- 
described strategy. 

As stated above, the host strains used are derivatives of 
Trichoderma so. which lack or have a nonfunctional gene or 
genes corresponding to the selectable marker chosen. For 
example, if the selectable marker of ovr4 is chosen, then a 
specific pvr derivative strain is used as a recipient in the 
transformation procedure. Similarly, selectable markers 
comprising Trichoderma sp. genes equivalent to the Aspergillus 
nidulans genes araB. trp C, niaD may be used. The 
corresponding recipient strain must therefore be a derivative 
strain such as araB-. troC 1 . niaD 1 . respectively. 

DNA encoding the truncated cellulase protein is then 
prepared for insertion into an appropriate microorganism. 
According to the present invention, DNA encoding for a 
truncated cellulase enzyme comprises DNA encoding for a 
protein which corresponds to the catalytic core region of the 
cellulase enzyme. Accordingly, DNA may be derived from any 
microbial source which is known to produce cellulase, where 
the gene is identified and isolated. In a preferred 
embodiment, the DNA encodes for a truncated cellulase protein 
derived from Trichoderma so. , and more preferably from 
Trichoderma lonqibrachiatum . Thus, the DNA may encode for an 
EGI, EGII, CBHI or CBHII core protein. Preferably, the DNA 
gene fragment or variant DNA fragment codes for the core or 
binding domains of a Trichoderma so. cellulase or derivative 
thereof that additionally retains the functional activity of 
the truncated core or binding domain, respectively. Further, 
the DNA fragment or variant thereof comprising the sequence of 
the core or binding domain regions may additionally have 
attached thereto a linker, or hinge region DNA sequence or 
portion thereof wherein the encoded truncated cellulase still 
retains either cellulase core or binding domain activity, 
respectively. 

The DNA fragment or DNA variant fragment encoding the 
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truncated cellulase or derivative may be functionally attached 
to a fungal promoter sequence, for example, the promoter of 
the cbhl or eall gene. It is also contemplated that more 
than one copy of DNA encoding a truncated cellulase may be 
5 recombined into the strain. 

The DNA encoding the truncated cellulase protein may be 
prepared by the construction of an expression vector carrying 
the DNA encoding the truncated cellulase. The expression 
vector carrying the inserted DNA fragment encoding the 

10 truncated cellulase may be any vector which is capable of 

replicating autonomously in a given host organism, typically a 
plasmid. In preferred embodiments two types of expression 
vectors for obtaining expression of genes or truncations 
thereof are contemplated. The first contains DNA sequences in 

15 which the promoter, gene coding region, and terminator 

sequence all originate from the gene to be expressed. The 
gene truncation is obtained by deleting away the undesired DNA 
sequences (coding for unwanted domains) to leave the domain to 
be expressed under control of its own transcriptional and 

20 translational regulatory sequences. A selectable marker is 
also contained on the vector allowing the selection for 
integration into the host of multiple copies of the novel gene 
sequences. 

For example, pEGIA3'pyr contains the EG I cellulase core 
25 domain under the control of the EG I promoter, terminator, and 
signal sequences. The 3' end on the EG I coding region 
containing the cellulose binding domain has been deleted. The 
plasmid also contains the pyr4 gene for the purpose of 
selection. 

30 The second type of expression vector is preassembled and 

contains sequences required for high level transcription and a 
selectable marker. It is contemplated that the coding region 
for a gene or part thereof can be inserted into this general 
purpose expression vector such that it is under the 

35 transcriptional control of the expression cassettes promoter 
and terminator sequences. 

For example, PTEX is such a general purpose expression 
vector. Genes or part thereof can be inserted downstream of 
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the strong CBHI promoter. The Examples disclosed herein are 
included in which cellulase catalytic core and binding domains 
are shown to be expressed using this system. 

In the vector, the DNA sequence encoding the truncated 
cellulase or other novel proteins of the present invention 
should be operably linked to transcriptional and translational 
sequences, i.e., a suitable promoter sequence and signal 
sequence in reading frame to the structural gene. The 
promoter may be any DNA sequence which shows transcriptional 
activity in the host cell and may be derived from genes 
encoding proteins either homologous or heterologous to the 
host cell. The signal peptide provides for extracellular 
expression of the truncated cellulase or derivatives thereof. 
The DNA signal sequence is preferably the signal sequence 
naturally associated with the truncated gene to be expressed, 
however the signal sequence from any exo-cellobiohydrolases or 
endoglucanase is contemplated in the present invention. 

The procedures used to ligate the DNA sequences coding 
for the truncated cellulases, derivatives thereof or other 
novel cellulases of the present invention with the promoter, 
and insertion into suitable vectors containing the necessary 
information for replication in the host cell are well known in 
the art. 

The DNA vector or construct described above may be 
introduced in the host cell in accordance with known 
techniques such as transformation, transf ection, 
microinjection, microporation, biolistic bombardment and the 

like. 

In the preferred transformation technique, it must be 
taken into account that the permeability of the cell wall in 
Trichoderma so. is very low. Accordingly, uptake of the 
desired DNA sequence, gene or gene fragment is at best 
minimal. There are a number of methods to increase the 

permeability of the Tr ighoderma sp - cel1 wal1 in the 
derivative strain (i.e., lacking a functional gene 
corresponding to the used selectable marker) prior to the 

transformation process. 

The preferred method in the present invention to prepare 
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ICiShadSPaa gp t for transformation involves the preparation of 
protoplasts from fungal mycelium. The mycelium can be obtained 
from germinated vegetative spores. The mycelium is treated 
with an enzyme which digests the cell wall resulting in 
protoplasts. The protoplasts are then protected by the 
presence of an osmotic stabilizer in the suspending medium. 
These stabilizers include sorbitol, mannitol, potassium 
chloride, magnesium sulfate and the like. Usually the 
concentration of these stabilizers varies between 0.8 M to 1.2 
M. It is preferable to use about a 1.2 M solution of sorbitol 
in the suspension medium. 

Uptake of the DNA into the host Trichodermn gp , strain is 
dependent upon the calcium ion concentration. Generally 
between about 10 Mm CaCl 2 and 50 Mm CaCl 2 is used in an uptake 
solution. Besides the need for the calcium ion in the uptake 
solution, other items generally included are a buffering 
system such as TE buffer (10 Mm Tris, Ph 7.4; 1 Mm EDTA) or 10 
Mm MOPS, Ph 6.0 buffer (morpholinepropanesulf onic acid) and 
polyethylene glycol (PEG). It is believed that the 
polyethylene glycol acts to fuse the cell membranes thus 
permitting the contents of the medium to be delivered into the 
cytoplasm of the Trichoderma ?I?T strain and the plasmid DNA is 
transferred to the nucleus. This fusion frequently leaves 
multiple copies of the plasmid DNA tandemly integrated into 
the host chromosome. 

Usually a suspension containing the Trichoderma sp. 
protoplasts or cells that have been subjected to a 
permeability treatment at a density of 10 8 to 10 9 /ml, 
preferably 2 x 10 8 /ml are used in transformation. These 
protoplasts or cells are added to the uptake solution, along 
with the desired linearized selectable marker having 
substantially homologous flanking regions on either side of 
said marker to form a transformation mixture. Generally a 
high concentration of PEG is added to the uptake solution. 
From 0.1 to 1 volume of 25% PEG 4000 can be added to the 
protoplast suspension. However, it is preferable to add about 
0.25 volumes to the protoplast suspension. Additives such as 
dimethyl sulfoxide, heparin, spermidine, potassium chloride 
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and the like may also be added to the uptake solution and aid 
in transformation. 

Generally, the mixture is then incubated at approximately 
0°C for a period between 10 to 30 minutes. Additional PEG is 
then added to the mixture to further enhance the uptake of the 
desired gene or DNA sequence. The 25% PEG 4000 is generally 
added in volumes of 5 to 15 times the volume of the 
transformation mixture; however, greater and lesser volumes 
may be suitable. The 25% PEG 4000 is preferably about 10 
times the volume of the transformation mixture. After the PEG 
is added, the transformation mixture is then incubated at room 
temperature before the addition of a sorbitol and CaCl 2 
solution. The protoplast suspension is then further added to 
molten aliquots of a growth medium. This growth medium 
permits the growth of transf ormants only. Any growth medium 
can be used in the present invention that is suitable to grow 
the desired transf ormants. However, if Pyr 1 transf ormants are 
being selected it is preferable to use a growth medium that 
contains no uridine. The subsequent colonies are transferred 
and purified on a growth medium depleted of uridine. 

At this stage, stable transf ormants were distinguished 
from unstable transf ormants by their faster growth rate and 
the formation of circular colonies with a smooth, rather than 
ragged outline on solid culture medium lacking uridine. 
Additionally, in some cases a further test of stability was 
made by growing the transf ormants on solid non-selective 
medium (i.e. containing uridine), harvesting spores from this 
culture medium and determining the percentage of these spores 
which will subsequently germinate and grow on selective medium 

lacking uridine. 

In a particular embodiment of the above method, the 
truncated cellulases or derivatives thereof are recovered in 
active form from the host cell either as a result of the 
appropriate post translational processing of the novel 
truncated cellulase or derivative thereof. 

The truncated cellulases are recovered from the medium by 
conventional techniques including separations of the cells 
from the medium by centrif ugation, filtration, and 
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precipitation of the proteins in the supernatant or filtrate 
with a salt, for example, ammonium sulphate • Additionally, 
chromatography procedures such as ion exchange chromatography, 
affinity chromatography may be used. Alternatively, the 
secreted protein product may be isolated and purified by 
binding to a polysaccharide substrate or antibody matrix. The 
antibodies (polyclonal or monoclonal) may be raised against 
cellulase core or binding domain peptides, or synthetic 
peptides may be prepared from portions of the core domain or 
binding domain and used to raise polyclonal antibodies. 

The DNA transformed into the host cell may comprise 
additional embodiments. For example, cellulase enzymes are 
contemplated which combine a core region derived from either a 
homologous or heterologous microbial source. 

III^ Methods Of Treating Cellulos e Containing Fabric Using 
Truncated Cellulase Enzvmes 

As noted above, the present invention pertains to methods 
for treating cellulose containing fabrics with a truncated 
cellulase enzyme. The use of the truncated cellulase 
composition of this invention provides the novel and 
surprising result of effecting a relatively low level of dye 
redeposition while maintaining an equivalent level of abrasion 
compared to prior art cellulase treatment. Because the level 
of abrasion acts as an indicator of the quality and 
effectiveness of particular cellulase treatment techniques, 
e.g., stonewashing or laundering, the use of the instant 
invention provides a surprisingly high quality textile 
treatment composition. In the laundering context, abrasion is 
sometimes referred to as color clarification, defuzzing or 
biopolishing. 

The present invention specifically contemplates the use 
of truncated cellulase core, alone or in combination with 
additional cellulase components, to achieve excellent abrasion 
with reduced redeposition when compared to non-truncated 
cellulase. Additionally, naturally occurring cellulase 
enzymes which lack a binding domain are contemplated as within 
the scope of the invention. It is also contemplated that the 
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methods of this invention will provide additional enhancements 
to treated cellulose containing fabric, including improvements 
in the feel and/ or appearance of the fabric. 

5 Methodolog y for Stonewashing With Truncated 

Cellulase Compositions 
According to the present invention, the truncated 
cellulase compositions described above may be employed as a 
stonevashing composition. Preferably, the stonewashing 

10 composition of the instant comprises an aqueous solution which 
contains an effective amount of a truncated cellulase together 
with other optional ingredients including, for example, a 
buffer, a surfactant, and a scouring agent. 

An effective amount of truncated cellulase enzyme 

15 composition is a concentration of truncated cellulase enzyme 
sufficient for its intended purpose. Thus an "effective 
amount" of truncated cellulase in the stonewashing composition 
according to the present invention is that amount which will 
provide the desired treatment, e.g., stonewashing. The amount 

20 of truncated cellulase employed is also dependent on the 
equipment employed, the process parameters employed (the 
temperature of the truncated cellulase treatment solution, the 
exposure time to the cellulase solution, and the like) , and 
the cellulase activity (e.g., a particular solution will 

25 require a lower concentration of cellulase where a more active 
cellulase composition is used as compared to a less active 
cellulase composition) . The exact concentration of truncated 
cellulase can be readily determined by the skilled artisan 
based on the above factors as well as the desired result. 

30 Preferably the truncated cellulase composition is present in a 
concentration of from 1-1000 ppm, more preferably 10-400 ppm 
and most preferably 20-100 ppm total protein. 

Optionally, a buffer is employed in the stonewashing 
composition such that the concentration of buffer is that 

35 which is sufficient to maintain the pH of the solution within 
the range wherein the employed truncated cellulase exhibits 
activity which, in turn, depends on the nature of the 
truncated cellulase employed. The exact concentration of 
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buffer employed will depend on several factors which the 
skilled artisan can readily take into account. For example, 
in a preferred embodiment, the buffer as well as the buffer 
concentration are selected so as to maintain the pH of the 
final truncated cellulase solution within the pH range 
required for optimal cellulase activity. Preferably, buffer 
concentration in the stonewashing composition is about 0.001N 
or greater. Suitable buffers include, for example, citrate 
and acetate. 

In addition to truncated cellulase and a buffer, the 
stonewashing composition may optionally contain a surfactant. 
Preferably, the surfactant is present in a concentration in 
the diluted wash mediums of greater than 100 ppm, preferably 
from about 200-15,000 ppm. Suitable surfactants include any 
surfactant compatible with the cellulase and the fabric 
including, for example, anionic, non-ionic and ampholytic 
surfactants. Suitable anionic surfactants for use herein 
include linear or branched alkylbenzenesulf onates; alkyl or 
alkenyl ether sulfates having linear or branched alkyl groups 
or alkenyl groups; alkyl or alkenyl sulfates; 
olefinsulf onates; alkanesulf onates and the like. Suitable 
counter ions for anionic surfactants include alkali metal ions 
such as sodium and potassium; alkaline earth metal ions such 
as calcium and magnesium; ammonium ion; and alkanolamines 
having 1 to 3 alkanol groups of carbon number 2 or 3. 
Ampholytic surfactants include quaternary ammonium salt 
sulfonates, and betaine-type ampholytic surfactants. Such 
ampholytic surfactants have both the positive and negative 
charged groups in the same molecule. Nonionic surfactants 
generally comprise polyoxyalkylene ethers, as well as higher 
fatty acid alkanolamides or alkylene oxide adduct thereof, and 
fatty acid glycerine monoesters. Mixtures of surfactants can 
also be employed in manners known in the art. 

In a preferred embodiment, a concentrated stonewashing 
composition can be prepared for use in the methods described 
herein. Such concentrates would contain concentrated amounts 
of the truncated cellulase composition described above, buffer 
and surfactant, preferably in an aqueous solution. When so 
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formulated, the stonewashing concentrate can readily be 
diluted with water so as to quickly and accurately prepare 
stonewashing compositions according to the present invention 
and having the requisite concentration of these additives* 
5 Preferably, such concentrates will comprise from about 0.1 to 
about 50 weight percent of a fungal cellulase composition 
described above (protein); from about 0.1 to about 80 weight 
percent buffer; from about 0 to about 50 weight percent 
surfactant; with the balance being water. When aqueous 

10 concentrates are formulated, these concentrates can be diluted 
so as to arrive at the requisite concentration of the 
components in the truncated cellulase solution as indicated 
above. As is readily apparent, such stonewashing concentrates 
will permit facile formulation of the truncated cellulase 

15 solutions as well as permit feasible transportation of the 
concentration to the location where it will be used. The 
stonewashing concentrate can be in any art recognized form, 
for example, liquid, emulsion, gel, or paste. Such forms are 
well known to the skilled artisan. 

20 When a solid stonewashing concentrate is employed, the 

cellulase composition may be a granule, a powder, an 
agglomerate or a solid disk. When granules are used, the 
granules are preferably formulated so as to contain a 
cellulase protecting agent. See, for instance, U.S. Serial 

25 No. 07/642,669, filed January 17, 1991 as Attorney Docket No. 
010055-073 and entitled "GRANULES CONTAINING BOTH AN ENZYME 
AND AN ENZYME PROTECTING AGENT AND DETERGENT COMPOSITIONS 
CONTAINING SUCH GRANULES," which application is incorporated 
herein by reference in its entirety. Likewise, the granules 

30 can be formulated so as to contain materials to reduce the 
rate of dissolution of the granules into the wash medium. 
Such materials and granules are disclosed in U.S. Patent No. 
5,254,283 which is incorporated herein by reference in its 
entirety. 

35 Other materials can also be used with or placed in the 

stonewashing composition of the present invention as desired, 
including stones, pumice, fillers, solvents, enzyme 
activators, and other anti-redeposition agents. 
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The cellulose containing fabric is contacted with the 
stonewashing composition containing an effective amount of the 
truncated cellulase enzyme or derivative by intermingling the 
treating composition with the stonewashing composition, and 
thus bringing the truncated cellulase enzyme into proximity 
with the fabric. For example, if the treating composition is 
an aqueous solution, the fabric may be directly soaked in the 
solution. Similarly, where the stonewashing composition is a 
concentrate, the concentrate is diluted into a water bath with 
the cellulose containing fabric- When the stonewashing 
composition is in a solid form, for example a pre-wash gel or 
solid stick, the stonewashing composition may be contacted by 
directly applying the composition to the fabric or to the wash 
liquor. 

The cellulose containing fabric is incubated with the 
stonewashing solution under conditions effective to allow the 
enzymatic action to confer a stonewashed appearance to the 
cellulose containing fabric. For example, during 
stonewashing, the pH, liquor ratio, temperature and reaction 
time may be adjusted to optimize the conditions under which 
the stonewashing composition acts. "Effective conditions" 
necessarily refers to the pH, liquor ratio, and temperature 
which allow the truncated cellulase enzyme to react 
efficiently with cellulose containing fabric. The reaction 
conditions for truncated cellulase core, and thus the 
conditions effective for the stonewashing compositions of the 
present invention, are substantially similar to well known 
methods used with corresponding non-truncated cellulases. 
Accordingly, the conditions effective for treatment of 
cellulose containing fabric with a stonewashing composition 
comprising CBHI type core according to the present invention 
are substantially similar to those in the prior art using 
wild-type CBHI type cellulase compositions. Similarly, where 
a mixture of truncated and non-truncated cellulase is 
utilized, the conditions should be optimized similar to where 
a similar combination may have been used. Accordingly, it is 
within the skill of those in the art to maximize conditions 
for using the stonewashing compositions according to the 
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present invent ion. 

The liquor ratios during stonewashing, i.e., the ratio of 
weight of stonewashing composition solution (i.e., the wash 
liquor) to the weight of fabric, employed herein is generally 
an amount sufficient to achieve the desired stonewashing 
effect in the denim fabric and is dependent upon the process 
used. Preferably, the liquor ratios are from about 4:1 to 
about 50:1; more preferably from 5:1 to about 20:1, and most 
preferably from about 10:1 to about 15:1. 

Reaction temperatures during stonewashing with the 
present stonewashing compositions are governed by two 
competing factors. Firstly, higher temperatures generally 
correspond to enhanced reaction kinetics, i.e., faster 
reactions, which permit reduced reaction times as compared to 
reaction times required at lower temperatures. Accordingly, 
reaction temperatures are generally at least about 10 °C and 
greater. Secondly, cellulase is a protein which loses 
activity beyond a given reaction temperature which temperature 
is dependent on the nature of the cellulase used. Thus, if 
the reaction temperature is permitted to go too high, then the 
cellulolytic activity is lost as a result of the denaturing of 
the cellulase. As a result, the maximum reaction temperatures 
employed herein are generally about 65°C. In view of the 
above, reaction temperatures are generally from about 30°C to 
about 65°C; preferably, from about 35°C to about 60°C; and 
more preferably, from about 35°C to about 55°C. 

Reaction times are dependent on the specific conditions 
under which the stonewashing occurs. For example, pH, 
temperature and concentration of truncated cellulase will all 
effect the optimal reaction time. Generally, reaction times 
are from about 5 minutes to about 5 hours, and preferably from 
about 10 minutes to about 3 hours and, more preferably, from 
about 20 minutes to about 1 hour. 

Cellulose containing fabrics treated in the stonewashing 
methods described above using truncated cellulase compositions 
according to the present invention show reduced redeposition 
of dye as compared to the same cellulose containing fabrics 
treated in the same manner with an non-truncated cellulase 
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composition. 



^ Methodology for Tr»» r j na c «ii»i 

'1th A np^ppent Cni^pn 





According to the present invention, the truncated 
cellulase compositons described above may be employed as 
detergent composition. The detergent compositions according 
to the present invention are useful as pre-wash compositions, 
pre-soak compositions, or for detergent cleaning during the 
regular wash cycle. Preferably, the detergent composition of 
the present invention comprises an effective amount of 
truncated cellulase, and a surfactant, and optionally include 
other ingredients described below. 

An effective amount of truncated cellulase employed in 
the detergent compositions of this invention is an amount 
sufficient to impart improved abrasion to cellulase containing 
fabrics. Preferably, the truncated cellulase employed is in a 
concentration of about 0.001% to about 25%, more preferably 
about 0.02% to about io% by weight percent of detergent. 

The specific concentration of truncated cellulase enzyme 
employed in the detergent composition is preferably selected 
so that upon dilution into a wash medium, the concentration of 
truncated cellulase enzyme is in a range of about o.l to about 
1000 ppm, preferably from about 0.2 ppm to about 500 ppm, and 
most preferably from about 0.5 ppm to about 250 ppm total 
protein. Thus, the specific amount of truncated cellulase 
enzyme employed in the detergent composition will depend on 
the extent to which the detergent will be diluted upon 
addition to water so as to form a wash solution. 

At lower concentrations of truncated cellulase enzyme, 
i.e., concentrations of truncated enzyme lower than 20 ppm,' 
the decreased backstaining or redeposition with equivalent' 
surface fiber abrasion when compared to prior art compositions 
will become evident after repeated washings. At higher 
concentrations, i.e., concentrations of truncated cellulase 
enzymes of greater than 40 ppm, the decreased backstaining 
with equivalent surface fiber removal will become evident 
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after a single wash. 

The detergent compositions of the present invention may 
be in any art recognized form, for example , as a liquid 
diluent, in granules, in emulsions, in gels, or in pastes. 
5 Such forms are well known to the skilled artisan. When a solid 
detergent composition is employed, the truncated cellulase is 
preferably formulated as granules. Preferably, the granules 
can be formulated so as to additionally contain a cellulase 
protecting agent. See, for instance, U.S. Serial No. 

10 07/642,669 filed January 17, 1991 as Attorney Docket No. 

010055-073 and entitled H GRANULES CONTAINING BOTH AN ENZYME 
AND AN ENZYME PROTECTING AGENT AND DETERGENT COMPOSITIONS 
CONTAINING SUCH GRANULES" which application is incorporated 
herein by reference in its entirety. Likewise, the granule 

15 can be formulated so as to contain materials to reduce the 

rate of dissolution of the granule into the wash medium. Such 
materials and granules are disclosed in U.S. Patent No. 
5,254,283 which is incorporated herein by reference in its 
entirety. 

20 The detergent compositions of this invention employ a 

surface active agent, i.e., surfactant, including anionic, 
non-ionic and ampholytic surfactants well known for their use 
in detergent compositions. 

Suitable anionic surfactants for use in the detergent 

25 composition of this invention include linear or branched 

alkylbenzenesulfonates; alkyl or alkenyl ether sulfates having 
linear or branched alkyl groups or alkenyl groups; alkyl or 
alkenyl sulfates; olef insulf onates; alkanesul-f onates and the 
like. Suitable counter ions for anionic surfactants include 

30 alkali metal ions such as sodium and potassium; alkaline earth 
metal ions such as calcium and magnesium; ammonium ion; and 
alkanolamines having 1 to 3 alkanol groups of carbon number 2 
or 3. 

Ampholytic surfactants include quaternary ammonium salt 
35 sulfonates, betaine-type ampholytic surfactants, and the like. 
Such ampholytic surfactants have both the positive and 
negative charged groups in the same molecule. 

Nonionic surfactants generally comprise polyoxyal-kylene 
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ethers, as well as higher fatty acid alkanolamides or alkylene 
oxide adduct thereof, fatty acid glycerine monoesters, and the 
like. 

Suitable surfactants for use in this invention are 
5 disclosed in British Patent Application No. 2 094 826 A, the 
disclosure of which is incorporated herein by reference. 

Mixtures of such surfactants can also be used. 

The surfactant or a mixture of surfactants is generally 
employed in the detergent compositions of this invention in an 
10 amount from about 1 weight percent to about 95 weight percent 
of the total detergent composition and preferably from about 5 
weight percent to about 45 weight percent of the total 
detergent composition. Upon dilution in the wash medium, the 
surfactant concentration is generally about 500 ppm or more; 
15 and preferably, from about 1000 ppm to 15 , 000 ppm. 

In addition to the cellulase composition and the 
surfactant ( s) , the detergent compositions of this invention 
can optionally contain one or more of the following 
components : 

20 

Hydrolases Excep t Cellulase 

Such hydrolases include carboxylate ester hydrolase, 
thioester hydrolase, phosphate monoester hydrolase, and 
phosphate diester hydrolase which act on the ester bond; 

25 glycoside hydrolase which acts on glycosyl compounds; an 
enzyme that hydrolyzes N-glycosyl compounds; thioether 
hydrolase which acts on the ether bond; and a-amino-acyl- 
peptide hydrolase, peptidyl-amino acid hydrolase, acyl-amino 
acid hydrolase, dipeptide hydrolase, and peptidyl-peptide 

30 hydrolase which act on the peptide bond. Preferable among 

them are carboxylate ester hydrolase, glycoside hydrolase, and 
peptidyl-peptide hydrolase. Suitable hydrolases include (1) 
proteases belonging to peptidyl-peptide hydrolase such as 
pepsin, pepsin B, rennin, trypsin, chyrootrypsin A, 

35 chymotrypsin B, elastase, enterokinase, cathepsin C, papain, 

chymopapain, ficin, thrombin, f ibrinolysin, renin, subtilisin, 
aspergillopeptidase A, collagenase, clostridiopeptidase B, 
kallikrein, gastrisin, cathepsin D. ; bromelin, keratinase, 
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chymotrypsin C, pepsin C, asp erg i 1 1 opept i da se B , urokinase, 
carboxypeptidase A and B, and aminopeptidase; (2) glycoside 
hydrolases (cellulase which is an essential ingredient is 
excluded from this group) cr-amylase, B-amylase, gluco amylase, 
5 invertase, lysozyme # pectinase, chitinase, and dextranase. 
Preferably among them are o-amylase and B-amylase. They 
function in acid to neutral systems, but one which is obtained 
from bacteria exhibits high activity in an alkaline system; 
(3) car boxy late ester hydrolase including carboxyl esterase, 

10 lipase, pectin esterase, and chlorophyllase. Especially 
effective among them is lipase. 

Trade names of commercial products and producers are as 
follows: "Alkalase", "Esperase", "Savinase", "AMG", "BAN", 
"Fungamill", "Sweetzyme", "Thermamyl" (Novo Industry, 

15 Copenhagen, Denmark); "Maksatase", "High-alkaline protease", 

"Amylase THC", "Lipase" (Gist Brocades, N.V. , rielft, Holland); 
"Protease B-400", "Protease B-4000", "Protease AP", "Protease 
AP 2100" (Schweizerische Ferment A.G., Basel, Switzerland); 
"CRD Protease" (Monsanto Company, St, Louis, Missouri); 

20 "Piocase" (Piopin Corporation, Monticello, Illinois); "Pronase 
P", "Pronase AS", "Pronase AF" (Kaken Chemical Co, , Ltd., 
Japan); "Lapidase P-2000" (Lapidas, Secran, France); protease 
products (Tyler standard sieve, 100% pass 16 mesh and 100% on 
150 mesh) (Clington Corn Products, Division of Standard Brands 

25 Corp., New York); "Takamine", "Bromelain 1:10", "HT Protease 
200", "Enzyme L-W" (obtained from fungi, not from bacteria) 
(Miles Chemical Company, Elkhart, Ind.); "Rhozyme P-ll Cone", 
"Pectinol" , "Lipase B", "Rhozyme PF", "Rhozyme J-2 5" (Rohm & 
Haas, Genencor, South San Francisco, CA) ; "Ambrozyme 200" 

30 (Jack Wolf & Co., Ltd., Subsidiary of Nopco Chemical Company, 
Newark, N.J.); "ATP 40", "ATP 120", "ATP 160" (Lapidas, 
Secran, France); "Oripase" (Nagase & Co., Ltd., Japan). 

The hydrolase other than cellulase is incorporated into 
the detergent composition as much as required according to the 

35 purpose. It should preferably be incorporated in an amount of 
0.001 to 5 weight percent, and more preferably 0.02 to 3 
weight percent, in terms of purified one. This enzyme should 
be used in the form of granules made of crude enzyme alone or 
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in combination with other components in the detergent 
composition. Granules of crude enzyme are used in such an 
amount that the purified enzyme is 0.001 to 50 weight percent 
in the granules. The granules are used in an amount of 0.002 
5 to 20 and preferably 0.1 to 10 weight percent. As with 

cellulases, these granules can be formulated so as to contain 
an enzyme protecting agent and a dissolution retardant 
material. 

10 Cationic Surfactants and Long-Chain Fat tv Acid Salts 

Such cationic surfactants and long-chain fatty acid salts 
include saturated or unsaturated fatty acid salts, alkyl or 
alkenyl ether carboxylic acid salts, a-sulfofatty acid salts 
or esters, amino acid-type surfactants, phosphate ester 

15 surfactants, quaternary ammonium salts including those having 
3 to 4 alkyl substituents and up to 1 phenyl substituted alkyl 
substituents. Suitable cationic surfactants and long-chain 
fatty acid salts are disclosed in British Patent Application 
No. 2 094 826 A, the disclosure of which is incorporated 

20 herein by reference. The composition may contain from about 1 
to about 20 weight percent of such cationic surfactants and 
long-chain fatty acid salts. 

25 A. Divalent sequestering agents. 

The composition may contain from about 0 to about 50 
weight percent of one or more builder components selected from 
the group consisting of alkali metal salts and alkanolaroine 
salts of the following compounds: phosphates, phosphonates, 

30 phosphonocarboxylates, salts of amino acids, aminopolyacetates 
high molecular electrolytes, non-dissociating polymers, salts 
of dicarboxylic acids, and aluminosilicate salts. Suitable 
divalent sequestering gents are disclosed in British Patent 
Application No. 2 094 826 A, the disclosure of which is 

3 5 incorporated herein by reference. 

B, Alkalis or inorga nic electrolytes 

The composition may contain from about 1 to about 50 
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weight percent, preferably from about 5 to about 30 weight 
percent, based on the composition of one or more alkali metal 
salts of the following compounds as the alkalis or inorganic 
electrolytes: silicates, carbonates and sulfates as well as 
organic alkalis such as triethanolamine, diethanolamine, 
monoethanolamine and triisopropanolamine. 

Antirede position Agents 

The composition may contain from about 0-1 to about 5 
weight percent of one or more of the following compounds as 
antiredeposition agents: polyethylene glycol, polyvinyl 
alcohol, polyvinylpyrrolidone and carboxymethylcellulose. 

Among them, a combination of carboxymethyl-cellulose 
and/or polyethylene glycol with the cellulase composition of 
the present invention provides for an especially useful dirt 
removing composition. 

pitching Agents 

The use of the cellulase of the present invention in 
combination with a bleaching agent such as sodium 
percarbonate, sodium perborate, sodium sulf ate/hydrogen 
peroxide adduct and sodium chloride/hydrogen peroxide adduct 
or /and a photo-sensitive bleaching dye such as zinc or 
aluminum salt of sulfonated phthalocyanine further improves 
the detergenting effects. 

Bluing ftg SDtS and Fluorescent Dves 

Various bluing agents and fluorescent dyes may be 
incorporated in the composition, if necessary. Suitable 
bluing agents and fluorescent dyes are disclosed in British 
Patent Application No. 2 094 826 A, the disclosure of which is 
incorporated herein by reference. 

caking In hibitors 

The following caking inhibitors may be incorporated in 
the powdery detergent: p-toluenesulf onic acid salts, 
xylenesulfonic acid salts, acetic acid salts, sulf osuccinic 
acid salts, talc, finely pulverized silica, clay, calcium 
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silicate (such as Micro-Cell of Johns Manville Co.), calcium 
carbonate and magnesium oxide* 

Masking Agents for Factors Inhibiting the Cellulase Activity 

The cellulase composition of this invention are 
deactivated in some cases in the presence of copper, zinc, 
chromium, mercury, lead, manganese or silver ions or their 
compounds. Various metal chelating agents and metal- 
precipitating agents are effective against these inhibitors. 
They include, for example, divalent metal ion sequestering 
agents as listed in the above item with reference to optional 
additives as well as magnesium silicate and magnesium sulfate. 

Cellobiose, glucose and gluconolactone act sometimes as 
the inhibitors. It is preferred to avoid the co-presence of 
these saccharides with the cellulase as far as possible. In 
case the co-presence in unavoidable, it is necessary to avoid 
the direct contact of the saccharides with the cellulase by, 
for example, coating them. 

Long-chain-fatty acid salts and cationic surfactants act 
as the inhibitors in some cases. However, the co-presence of 
these substances with the cellulase is allowable if the direct 
contact of them is prevented by some means such as tableting 
or coating. 

The above-mentioned masking agents and methods may be 
employed, if necessary, in the present invention. 

Cellulase-Activators 

The activators vary depending on variety of the 
cellulases. In the presence of proteins, cobalt and its 
salts, magnesium and its salts, and calcium and its salts, 
potassium and its salts, sodium and its salts or monosac- 
charides such as mannose and xylose, the cellulases are 
activated and their deterging powers are improved remarkably. 

Antioxidants 

The antioxidants include, for example, tert-butyl- 
hydroxytoluene, 4 , 4 ' -butyl idenebis (6-tert-butyl-3- 
roethylphenol) , 2,2' -butylidenebis (6-tert-butyl-4- 
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thylphenol), monostyrenated cresol, distyrenated cresol 
monostyrenated phenol, distyrenated phenol and l,l-bis(4- 
hydroxy-pheny 1 ) cyclohexane . 



The solubilizers include, for example, lower alcohols 
such as ethanol, benzenesulfonate salts, lower 
alkylbenzenesulfonate salts such as p-toluenesulfonate salts, 
glycols such as propylene glycol, acetylbenzene-sulf onate 
salts, acetamides, pyridinedicarboxylic acid amides, benzoate 

salts and urea. 

The detergent composition of the present invention can be 
used in a broad pH range of from acidic to alkaline pH. In a 
preferred embodiment, the detergent composition of the present 
invention can be used in alkaline detergent wash media and 
more preferably, alkaline detergent wash media having a pH of 
from above 7 to no more than about 11. 

Aside from the above ingredients, perfumes, buffers, 
preservatives, dyes and the like can be used, if desired, with 
the detergent compositions of this invention. Such components 
are conventionally employed in amounts heretofore used in the 



When a detergent base used in the present invention is in 
the form of a powder, it may be one which is prepared by any 

5 known preparation methods including a spray-drying method and 
a granulation method. The detergent base obtained 
particularly by the spray-drying method and/ or spray-drying 
granulation method are preferred. The detergent base obtained 
by the spray-drying method is not restricted with respect to 

0 preparation conditions. The detergent base obtained by the 
spray-drying method is hollow granules which are obtained by 
spraying an aqueous slurry of heat-resistant ingredients, such 
as surface active agents and builders, into a hot space. The 
granules have a size of from 50 to 2000 micrometers. After 

5 the spray-drying, perfumes, enzymes, bleaching agents, 
inorganic alkaline builders may be added. With a highly 
dense, granular detergent base obtained such as by the spray- 
drying-granulation method, various ingredients may also be 
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added after the preparation of the base. 

When the detergent base is a liquid, it may be either a 
homogeneous solution or an inhomogeneous dispersion. For 
removing the decomposition of carboxymethylcellulose by the 
5 cellulase in the detergent, it is desirable that 

carboxymethylcellulose is granulated or coated before the 
incorporation in the composition. 

The detergent compositions of this invention may be 
incubated with cellulose containing fabric, for example soiled 
10 fabrics, in industrial and household uses at temperatures, 
reaction times and liquor ratios conventionally employed in 
these environments. The incubation conditions, i.e., the 
conditions effective for treating cellulose containing fabrics 
with detergent compositions according to the present 
15 invention, will be readily ascertainable by those of skill in 
the art. Accordingly, the appropriate conditions effective 
for treatment with the present detergents will correspond to 
those using similar detergent compositions which include wild- 
type cellulase. 

20 As indicated above, detergents according to the present 

invention may additionally be formulated as a pre-wash in the 
appropriate solution at an intermediate pH where sufficient 
activity exists to provide desired improvements in abrasion 
and reduced strength loss. When the detergent composition is 

25 a pre-soak (e.g., pre-wash or pre-treatment) composition, 
either as a liquid, spray, gel or paste composition, the 
truncated cellulase enzyme is generally employed from about 
0.0001 to about 1 weight percent based on the total weight of 
the pre-soak or pre-treatment composition. In such 

30 compositions, a surfactant may optionally be employed and when 
employed, is generally present at a concentration of from 
about 0.005 to about 20 weight percent based on the total 
weight of the pre-soak. The remainder of the composition 
comprises conventional components used in the pre-soak, i.e., 

35 diluent, buffers, other enzymes (proteases) , and the like at 
their conventional concentrations. 

Also, it is contemplated that compositions comprising 
truncated cellulase enzymes described herein can be used in 
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home use as a stand alone composition suitable- for restoring 
color to faded fabrics (see, for example, U.S. Patent No. 
4,738,682, which is incorporated herein by reference in its 
entirety) as well as used in a spot-remover. 

In order to further illustrate the present invention and 
advantages thereof, the following specific examples are given 
with the understanding that they are being offered to 
illustrate the present invention and should not be construed 
in any way as limiting its scope ♦ 




Example l 

15 Cloning and Expression of EG1 Core Domain Using 

its Own Promoter, Terminator and Signal Sequence. 

Ear£ i- cloning 

The complete eall gene used in the construction of the 

20 EG1 core domain expression plasmid, PEGlA3'pyr, was obtained 
from the plasmid PUC218::EG1. (See Fig. 6) The 3' terminator 
region of eall was ligated into PUC218 (Korman, D. et al Curr 
Genet 17:203-212, 1990) as a 300 bp Bsml-EcoRI fragment along 
with a synthetic linker designed to replace the 3' intron and 

25 cellulose binding domain with a stop codon and continue with 
the eall terminator sequences. The resultant plasmid, PEG IT, 
was digested with Hind lll and fiasl and the vector fragment was 
isolated from the digest by agarose gel electrophoresis 
followed by electroelution. The eall gene promoter sequence 

30 and core domain of eall were isolated from PUC218:;EG1 as a 
2.3kb Hind lll- gst I fragment and ligated with the same 
synthetic linker fragment and the Hind lll-BsmI digested PEG IT 

to form PEG1A3 ' 

The net result of these operations is to replace the 3' 
35 intron and cellulose binding domain of egll with synthetic 

oligonucleotides of 53 and 55bp. These place a TAG stop codon 
after serine 415 and thereafter continued with the egll 
terminator up to the fisnl site. 

Next, the T. lonaibrachiatum selectable marker, pvr4 . was 
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obtained from a previous clone p2l9M (Smith et al 1991) , as an 
isolated l.6kb EcoRI- Hind lll fragment. This was incorporated 
into the final expression plasmid, PEGlA3'pyr, in a three way 
ligation with PUC18 plasmid digested with EcoR l and 
5 dephosphorylated using calf alkaline phosphatase and a 

Hind lll-EcoRI fragment containing the eall core domain from 
PEG1A3 9 • 

Part 2 1 Transfprmatipn and Expression 

10 A large scale DNA prep was made of PEGlA3'pyr and from 

this the EcoR I fragment containing the eall core domain and 
ovr4 gene was isolated by preparative gel electrophoresis. 
The isolated fragment was transformed into the uridine 
auxotroph version of the quad deleted strain, 1A52 pyrl3 

15 (described in U.S. Patent Application Serial Nos. 07/770,049, 
08/048,728 and 08/048,881, incorporated by reference in its 
entirety herein), and stable transf ormants were identified. 

To select which transf ormants expressed eall core domain 
the transf ormants were grown up in shake flasks under 

20 conditions that favored induction of the cellulase genes 

(Vogels + 1% lactose). After 4-5 days of growth, protein from 
the supernatants was concentrated and either 1) run on SDS 
polyacrylamide gels prior to detection of the eall core domain 
by Western analysis using EGI polyclonal antibodies or 2) the 

25 concentrated supernatants were assayed directly using RBB 
carboxy methyl cellulose as an endoglucanase specific 
substrate and the results compared to the parental strain 1A52 
as a control. Transf ormant candidates were identified as 
possibly producing a truncated EGI core domain protein. 

30 Genomic DNA and total mRNA was isolated from these strains 
following growth on Vogels + 1% lactose and Southern and 
Northern blot experiments performed using an isolated DNA 
fragment containing only the eall core domain. These 
experiments demonstrated that transf ormants could be isolated 

35 having a copy of the eall core domain expression cassette 
integrated into the genome of 1A52 and that these same 
transf ormants produced eall core domain mRNA. 

One transformant was then grown using media suitable for 
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cellulase production in Trichoderma well known tn the art that 
was supplemented with lactose (Warzymoda, M. et al 1984 French 
Patent No. 2555603) in a 14L ferraentor. The resultant broth 
was concentrated and the proteins contained therein were 
5 separated by SDS polyacrylamide gel electrophoresis and the 

Ea 11 core domain protein identified by Western analysis. (See 
Example 3 below) . It was subsequently estimated that the 
protein concentration of the fermentation supernatant was 
about 5-6 g/L of which approximately 1.7-4.4g/L was EGI core 
10 domain based on CMCase activity. This value is based on an 

average of several EGI core fermentations that were performed. 

In a similar manner, any other cellulase domain or 
derivative thereof may be produced by procedures similar to 
those discussed above. 

15 

Example 2 

Purification of EGI and EGII catalytic cores 

Part it ESI catalytic cor? 

20 The EGI core was purified in the following manner. The 

concentrated (UF) broth was diluted to 14 rog/ml in 23 mN Na 
Acetate pH 5.0. Two hundred grams of avicel cellulose gel 
(FMC Bioproducts, Type PH-101) was added to the diluted EGI 
core broth and mixed at room temperature for forty five 

25 minutes. The avicel was removed from the broth by 

centrif ugation, resulting in an enriched EGI core solution. 
This solution was then buffer exchanged into lOmM TES pH 7.5 
using a Amicon stirred cell concentrator with a PM 10 membrane 
(diaflo ultra filtration membranes, Amicon Cat # 13132MEM 

30 5468A) . The EGI core sample was then loaded onto an anion 
exchange column (Q-sepharose fast flow, Pharmacia Cat t 17- 
0510-01) and eluted in a salt gradient from 0 to 0.5M NaCl in 
10 mM TES pH 7.5. The fractions which contained the EGI core 
were combined and concentrated using the Amicon stirred cell 

35 concentrator mentioned above. 

Part 2. EGIT catalytic core 
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EGII core was purified in the following manner. The 
concentrated (UF) broth was filtered using diatomaceous earth. 
Ammonium sulfate was added to the broth to a final 
concentration of 1M (NH 4 ) 2 SO i and this mixture was then loaded 
on to a hydrophobic column (phenyl-sepharose fast flow, 
Pharmacia, cat #17-0965-01). The column was washed with 0.75M 
ammonium sulfate before elution of the EGII core with 0.5M 
ammonium sulfate. The fractions containing the core were 
pooled and concentrated using a tangential flow ultra 
filtration membrane (Filtron Minisette Ultrafiltration System, 
cat #FS018K01) with an Omega 10K Membrane (Filtron, cat* 
FS010K75) . 100 grams of avicel cellulose gel (FMC 
Bioproducts, Type PH-101) was added for every gram of 
concentrated EGII core and mixed for 40 minutes. The avicel 
was removed by centrif ugation resulting in an enriched EGII 
core solution. 

Example 3 

Cloning and Expression of CBHII Core Domain Using the 
CBHI Promoter , Terminator a nd Signal Sequence from CBHII. 

Pgrt *- Construction of the T. longi brachiatum general- 

purpose expression plasmid-PTEX 

The plasmid, PTEX was constructed following the methods 

of Sambrook et al. (1989), supra, and is illustrated in Fig. 

7, This plasmid has been designed as a multi-purpose 

expression vector for use in the filamentous fungus 

Trichoderma lonoibrachiatum. The expression cassette has 

several unique features that make it useful for this function. 

Transcription is regulated using the strong CBH I gene 

promoter and terminator sequences for T . 1 ona i br a ch i a turn . 

Between the CBHI promoter and terminator there are unique Pme l 

and Sst I restriction sites that are used to insert the gene to 

be expressed. The T. lonaibrachiatum pyr4 selectable marker 

gene has been inserted into the CBHI terminator and the whole 

expression cassette (CBHI promoter-insertion sites-CBHI 

terminator-Eyr4 gene-CBHI terminator) can be excised utilizing 

the unique NotI restriction site or the unique Not I and Nhel 
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restriction sites. 

This vector is based on the bacterial vector, pSL1180 
(Pharmacia Inc., Piscataway, New Jersey), which is a PUC-type 
vector with an extended multiple cloning site. One skilled in 
the art would be able to construct this vector based on the 
flow diagram illustrated in FIG 7. (See also U.S. patent 
application 07/954,113 filed September 30, 1992 entitled 
"Stonewashing of Denim Garments Using Endoglucanase I & III" 
for the construction of PTEX expression plasmid.) 

It would be possible to construct plasmids similar to 
PTEX-truncated cellulases or derivatives thereof described in 
the present invention containing any other piece of DNA 
sequence replacing the truncated cellulase gene. 

Part 2. Cloning 

The complete cbh2 gene used in the construction of the 
CBHII core domain expression plasmid, PTEX CBHII core, was 
obtained from the plasmid PUC219: :CBHII (Korman, D. et al, 
1990, Curr Genet 17:203-212). The cellulose binding domain, 
positioned at the 5' end of the cbh2 gene, is conveniently 
located between an Xba l and SnaB I restriction sites. In order 
to utilize the Xfeal site an additional Xbal site in the 
polylinker was destroyed. PUC219: :CBHII was partially 
digested with Xba l such that the majority of the product was 
linear. The yba l overhangs were filled in using T4 DNA 
polymerase and ligated together under conditions favoring self 
ligation of the plasmid. This has the effect of destroying 
the blunted site which, in 50% of the plasmids, was the Xbal 
site in the polylinker. Such a plasmid was identified and 
digested with Xba l and SnaB I to release the cellulose binding 
domain. The vector-CBHII core domain was isolated and ligated 
with the following synthetic oligonucleotides designed to join 
the Xba l site with the SnaB I site at the signal peptidase 
cleavage site and papain cleavage point in the linker domain. 

Xba l SBaB 1 
5' CTA GAG CCG TCG GGA ACC GCT AC 3' (SEQ ID NO: 42) 

3' TC CTC GCC AGC CCT TGG CGA TG 5' 

Leu Glu Glu Arg Ser Gly Thr Ala Thr (SEQ ID NO: 43) 

- 52 - 



WO 96/23928 PCTAJS96/00977 



The resultant plasmid, pUCACBD CBHII, was digested with 
Hh&I and the ends blunted by incubation with T4 DNA polymerase 
and dNTPs. After which the linear blunted plasmid DNA was 
digested with fialll and the Nhe (blunt) Bal ll fragment 
5 containing the CBHII signal sequence and core domain was 
isolated. 

The final expression plasmid was engineered by digesting 
the general purpose expression plasmid, pTEX (disclosed in 
07/954 , 113 , incorporated in its entirety by references, and 

10 described in Part 3 below) , with £jg£II and Pmel and ligating 
the CBHII jahfil (blunt) -fialU fragment from pUCACBD CBHII 
downstream of the cbhl promoter using a synthetic 
oligonucleotide having the sequence CGCTAG to fill in the 
fialll overhang with the £s£II overhang. The construction of 

15 the resulting pTEC CBHIIcore expression vector is summarized 
in Fig. 9. 

The pTEX-CBHI core expression plasmid was prepared in a 
similar manner as pTEX-CBHII core described in the above 
example. Its construction is exemplified in Fig. 8. 

20 

Part ?« Transformation and Expression 

A large scale DNA prep was made of pTEX CBHIIcore and 
from this the NotI fragment containing the CBHII core domain 
under the control of the shhl transcriptional elements and 

25 pyr4 gene was isolated by preparative gel electrophoresis. 
The isolated fragment was transformed into the uridine 
auxotroph version of the quad deleted strain, 1A52 pyrl3, and 
stable transf ormants were identified. 

To select which transf ormants expressed cbh2 core domain 

30 genomic DNA was isolated from strains following growth on 

Vogels + 1% glucose and Southern blot experiments performed 
using an isolated DNA fragment containing only the cbh2 core 
domain. Transf ormants were isolated having a copy of the cbh2 
core domain expression cassette integrated into the genome of 

35 1A52 . Total mRNA was isolated from the two strains following 
growth for 1 day on Vogels + 1% lactose. The mRNA was 
subjected to Northern analysis using the cbh2 coding region as 
a probe. Transf ormants expressing cbh2 core domain mRNA were 
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identified. 

Two trans formants were grown under the same conditions as 
previously described in Example 1 in 14L fermentors. The 
resultant broth was concentrated and the proteins contained 
5 therein were separated by SDS polyacrylamide gel 

electrophoresis and the CBHII core domain protein identified 
by Western analysis. One transf ormant, #15, produced a 
protein of the correct size and reactivity to CBHII polyclonal 
antibodies. 

10 It was subsequently estimated that the protein 

concentration of the fermentation supernatant after 
purification was lOg/L of which 30-50% was CBHII core domain 

■ 

(See Example 4) . 

One may obtain any other novel truncated cellulase core 
15 domain protein or derivative thereof by employing the methods 
described above. 

Example 4 

Purification of cpHJ *?id WHJl ca^lytic ports 

20 

Part l. CBHI catalytic core 

The CBHI core was purified from broth in the following 
manner. The CBHI core ultraf iltered (UF) broth was filtered 
using diatomaceous earth and diluted in 10 mM TES pH 6.8 to a 

25 conductivity of 1.5 mOhm. The diluted CBHI core was then 

loaded onto an anion exchange column (Q-Sepharose fast flow, 
Pharmacia cat # 17-0510-01) equilibrated in 10 mM TES pH 6.8. 
The CBHI core was separated from the majority of the other 
proteins in the broth using a gradient elution in 10 mM TES pH 

30 6.8 from 0 to 1M NaCl . The fractions containing the CBHI core 
were then concentrated on an Amicon stirred cell concentrator 
with a PM 10 membrane (diaflo ultra filtration membranes , 
Amicon Cat # 13132MEM 5468A) . This step concentrated the 
core, and additionally effected a separation of the core from 

35 lower molecular weight proteins. The resulting fractions were 
greater than 85% pure CBHI core. The purest fraction was 
sequence verified to be the CBHI core. 
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Part: 2. CBHII catalytic core 

It is predicted that CBHII catalytic core will purify in 
a manner similar to that of CBHII cellulase because of its 
similar biochemical properties. The theoretical pi of the 
5 CBHII core is less than half a pH unit lower than that of 
CBHII. Additionally, CBHII catalytic core is approximately 
80% of the molecular weight of CBHII. Therefore, the 
following proposed purification protocol is based on the 
purification method used for CBHII. The diatomaceous earth 

10 treated, ultra filtered (UF) CBHII core broth is diluted into 
10 mM TES pH 6.8 to a conductivity of <0.7 mOhm. The diluted 
CBHII core is then loaded onto an anion exchange column (Q- 
Sepharose fast flow, Pharmacia, cat # 17 0510-01) equilibrated 
in 10 mM TES pH 6.8. A salt gradient from 0 to 1M NaCl in 10 

15 mM TES pH 6.8 is used to elute the CBHII core off the column. 
The fractions which contain the CBHII core is then buffer 
exchanged into 2mM sodium succinate buffer and loaded onto a 
cation exchange column (SP-sephadex C-50) . The CBHII core is 
next eluted from the column with a salt gradient from 0 to 

20 100mM NaCl. 

Example 5 

ptonewaptgng Wlt)i Truncated CeUvlase En?YP« 

25 This example demonstrates that the use of truncated 

cellulase catalytic cores, and in particular truncated 
endoglucanase catalytic cores, surprisingly results in a 
reduced redeposition of dye onto a cellulose containing 
(denim) fabric during stonewashing at an equivalent level of 

30 abrasion to non-truncated cellulase. Further, the combination 
of truncated CBH-type catalytic core and non-truncated EG- type 
cellulase resulted in significantly improved backstaining 
characteristics when compared to non-truncated combinations at 
equivalent levels of abrasion. Denim is cotton cloth which 

35 has been dyed. Methods for imparting a stone washed 

appearance to denim fabrics are described in U.S Patent No. 
4,832,864 and U.S. Serial No. 07/954,113 which are 
incorporated herein by reference in its entirety. 
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Truncated cellulase catalytic cores and cellulase 
components were purified for use in this example as follows: 

a) EGIII cellulase component was purified according to 
the method described in U.S. Patent No. 5,328,841 which 

5 disclosure is incorporated herein by reference in its 
entirety. 

b) truncated EG I catalytic cellulase core was purified by 
the method of Example 2. 

c) truncated EGII catalytic cellulase core was purified 
10 by the method of Example 2. 

d) truncated CBHI catalytic core was prepared according 
to the method of Example 4. 

e) EGI cellulase component was purified for use in 
stonewashing applications in the following manner. The 

15 concentrated (UF) broth was diluted in 25 mM sodium acetate 
buffer pH 5.0 to a concentration of 7 mg/ml protein. 450 
grams of avicel cellulose gel (FMC Bioproducts, Type PH-101) 
was added to the diluted broth and mixed at room temperature 
for thirty minutes. The avicel was removed from the broth by 

20 centrifugation resulting in an enriched EGI catalytic core. 

f) EGII cellulase component was purified for use in 
stonewashing in the following manner. The concentrated (UF) 
broth was filtered through diatomaceous earth and buffer 
exchanged into citrate-phosphate pH 7, 0.4 mOhm using an 

25 ultrafiltration unit (Filtron Minisette Ultrafiltration 

Syustem, cat # FS018K01) with an Omega 10K Membrane (Filtron, 
cat. # FS010K75) . This material was then loaded onto an anion 
exchange column (Q-sepharose fast flow, Pharmacia, cat. #17- 
0510-01) equilibrated in the above citrate-phosphate buffer. 

30 EGII was collected in the column flow through and fubber 

exchanged into sodium citrate pH 3.25, 0.5 mOhms using the 
ultrafiltration unit mentioned above. This material was then 
loaded onto a cation exchange column (SP-Spherodex LS, IBF 
Biotechnics, cat #262041). EGII was then eluted off the 

35 column in a linear gradient from pH 3.25 to 5.4. The 
fractions containing EGII were pooled and then used in 
stonewashing applications. 

g) CBH I cellulase component was purified according to 



56 



WO 96/23928 PCT/US96/009T7 



the method described in Example 15 of U.S. patent application 
serial no. 07/954,113. 

These truncated cellulase catalytic core domains and 
cellulase components were tested alone or in combination for 
5 their ability to impart a stonevashed appearance to dyed 
denims. Specif ically, the stonevashed denim fabrics were 
prepared using an industrial washer and dryer under the 
following conditions: 

Citrate/phosphate buffer § pH 5 
10 38L total volume 

120-135° F 

Six pair of denim pants with an equal weight of ballast 
1 hour run time 

15-30 ppm EGI , EGII , EG11I; 50 ppm CBH I and CBHI 
15 catalytic core; and 30-70 ppm EGI catalytic core and EGII 
catalytic core. 

A detergent post wash included 90 grams of AATCC 
detergent without brighteners in the case of CBHI, CBHI 
catalytic core, EGIII plus CBHI and EGIII plus CBHI catalytic 
20 core. 

The fabrics were evaluated for their stonewashed 

i 

appearance (abrasion) and also for redeposition of dye onto 
the fabric by five panelists. The fabrics were graded using a 
scale consisting of denim swatches which have incrementally 

25 increasing levels of abrasion or redeposition. The scales are 
numerated from 1-10; 1 representing the jeans with the least 
amount of abrasion or redeposition (desize denim) , and 10 
representing a highly abraded or redeposited denim. The 
appearance of the treated jeans was directly compared to the 

30 denim swatches in the scale and give a number based on the 

swatch in the scale they most closely matched. The results of 
the evaluation are shown in Figs. 10 and 11. 

As shown in Fig. 10, truncated EGI catalytic core and 
truncated EGII catalytic core provide significant advantages 

3 5 over their un-modified, native state counterparts in abrasion 
and reduced redeposition. 

As shown in Fig. 11, the addition of CBHI cellulase 
component to EGIII resulted in a slight enhancement of 
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stonewashing but a large increase in backstaining 
(redeposition) of dye onto the fabric. However, the addition 
of truncated CBHI catalytic core to EGIII solution improved 
results in both increased abrasion and decreased redeposition. 
In fact, the addition of truncated CBHI catalytic core to 
EGIII cellulase decreased redeposition to a level below that 
of EGIII alone* 

As these result show, the use of truncated cellulase core 
during treatment of cellulose containing fabrics surprisingly 
results in decreased backstaining while either maintaining an 
equivalent or superior level of abrasion. Further, this 
result is achieved either when the truncated cellulase is used 
alone, or when it is mixed with another wild-type or truncated 
cellulase. These results are both unexpected and surprising. 



58 



WO 96/23928 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(I) APPLICANT: GENENCOR INTERNATIONAL, INC. 



00 TITLE OF INVENTION: Novel Cellulase Enzymes and Systems 
For Their Expression 

(Hi) NUMBER OF SEQUENCES: 43 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genencor International 

(B) STREET: 180 Kimball Way 

(C) CITY: South San Francisco 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1 .0, Version #1 .25 

(vl) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Christopher L Stone 

(B) REGISTRATION NUMBER: 36,696 

(C) REFERENCE/DOCKET NUMBER: GC226-2-PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 742-7555 

(B) TELEFAX: (415)742-7217 



(2) INFORMATION FOR SEQ ID NO:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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Ox) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..96 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 

GGC CAG TGC GGC GGT ATT GGC TAC AGC GGC CCC ACG GTC TGC GCC AGC 48 
Gly Gin Cys Gly Gly He Gly Tyr Ser Gly Pro Thr Val Cys Ala Ser 
1 5 10 15 

GGC ACA ACT TGC CAG GTC CTG AAC CCT TAC TAC TCT CAG TGC CTG TA 96 
Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin Cys Leu 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:2: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Pro Thr Val Cys Ala Ser 
1 5 10 15 

Gly Thr Thr Cys Gin Val Leu Asn Pro Tyr Tyr Ser Gin Cys Leu 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:3: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME7KEY: CDS 

(B) LOCATION: join(1..20. 70..166) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CAA GCT TGC TCA AGC GTC TG GTAATTATGT GAACCCTCTC AAGAGACCCA 50 
Gin Ala Cys Ser Ser Val Trp 
1 5 
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AATACTGAGA TATGTCAAG G GGC CAA TGT GGT GGC CAG AAT TGG TCG GGT 1 00 

Gly Gin Cys Gly Gly Gin Asn Trp Ser Gly 
10 15 

CCG ACT TGC TGT GCT TCC GGA AGC ACA TGC GTC TAC TCC AAC GAC TAT 148 
Pro Thr Cys Cys Ala Ser Gly Ser Thr Cys Val Tyr Ser Asn Asp Tyr 
20 25 30 

TAC TCC CAG TGT CTT CCC 166 
Tyr Ser Gin Cys Leu Pro 
35 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Gin Ala Cys Ser Ser Val Trp Gly Gin Cys Gly Gly Gin Asn Trp Ser 
1 5 10 15 

Gly Pro Thr Cys Cys Ala Ser Gly Ser Thr Cys Val Tyr Ser Asn Asp 
20 25 30 

Tyr Tyr Ser Gin Cys Leu Pro 
35 

(2) INFORMATION FOR SEQ ID NO:5: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join(1..82. 140..159) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

CAC TGG GGG CAG TGC GGT GGC ATT GGG TAC AGC GGG TGC AAG ACG TGC 48 
His Trp Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Cys Lys Thr Cys 
1 5 10 15 

ACG TCG GGC ACT ACG TGC CAG TAT AGC AAC GAC T GTTCGTATCC 92 
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Thr SerGly ThrThr Cys Gin Tyr Ser Asn Asp 
20 25 

CCATGCCTGA CGGGAGTGAT TTTGAGATGC TAACCGCTAA AATACAG AC TAC TCG 
147 

Tyr Tyr Ser 
30 

CAATGCCTTTA 159 
Gin Cys Leu 



(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

His Trp Gly Gin Cys Gly Gly lie Gly Tyr Ser Gly Cys Lys Thr Cys 
1 5 10 15 

Thr Ser Gly Thr Thr Cys Gin Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



Ox) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..108 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

CAG CAG ACT GTC TGG GGC CAG TGT GGA GGT ATT GGT TGG AGC GGA CCT 48 
Gin Gin Thr Val Trp Gly Gin Cys Gly Gly He Gly Trp Ser Gly Pro 
1 6 10 15 
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ACG AAT TGT GCT CCT GGC TCA GCT TGT TCG ACC CTC AAT CCT TAT TAT 96 
Tfir Asn Cys Ala Pro Gly Ser Ala Cys Ser Thr Leu Asn Pro Tyr Tyr 
20 25 30 

GCG CAA TGT ATT 108 
Ala Gin Cys lie 
35 



(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Gin Gin Thr Val Trp Gly Gin Cys Gly Gly lie Gly Tip Ser Gly Pro 
1 5 10 15 

Thr Asn Cys Ala Pro Gly Ser Ala Cys Ser Thr Leu Asn Pro Tyr Tyr 
20 25 30 

Ala Gin Cys lie 
35 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join(1.. 410. 478..1174, 1238.. 1453) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CAG TCG GCC TGC ACT CTC CAA TCG GAG ACT CAC CCG CCT CTG ACA TGG 48 
Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr Trp 
1 5 10 15 

CAG AAA TGC TCG TCT GGT GGC ACT TGC ACT CAA CAG ACA GGC TCC GTG 96 
Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser Val 
20 25 30 

GTC ATC GAC GCC AAC TGG CGC TGG ACT CAC GCT ACG AAC AGC AGC ACG 144 
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Val lie Asp Ala Asn Tip Arg Trp Thr Mis Ala Thr Asn Ser Ser Thr 
35 40 45 

AAC TGC TAC GAT GGC AAC ACT TGG AGC TCG ACC CTA TGT CCT GAC AAC 1 92 
Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp Asn 
50 55 60 

GAG ACC TGC GCG AAG AAC TGC TGT CTG GAC GGT GCC GCC TAC GCG TCC 240 
Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala Ser 
65 70 75 SO 

ACG TAC GGA GTT ACC ACG AGC GGT AAC AGC CTC TCC ATT GGC TTT GTC 288 
Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe Val 
85 90 95 

ACC CAG TCT GCG CAG AAG AAC GTT GGC GCT CGC CTT TAC CTT ATG GCG 336 
Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met Ala 
100 105 110 

AGC GAC ACG ACC TAC CAG GAA TTC ACC CTG CTT GGC AAC GAG TTC TCT 384 
Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe Ser 
115 120 125 

TTC GAT GTT GAT GTT TCG CAG CTG CC GTAAGTGACT TACCATGAAC 430 

Phe Asp Val Asp Val Ser Gin Leu Pro 
130 135 

CCCTGACGTA TCTTCTTGTG GGCTCCCAGC TGACTGGCCA ATTTAAG G TGC GGC 
484 

Cys Gly 

TTG AAC GGA GCT CTC TAC TTC GTG TCC ATG GAC GCG GAT GGT GGC GTG 532 
Leu Asn Gly Ala Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val 
140 .145 150 155 

AGC AAG TAT CCC ACC AAC ACC GCT GGC GCC AAG TAC GGC ACG GGG TAC 580 
Ser Lys Tyr Pro Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr 
160 165 170 

TGT GAC AGC CAG TGT CCC CGC GAT CTG AAG TTC ATC AAT GGC CAG GCC 628 
Cys Asp Ser Gin Cys Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala 
175 180 185 



AAC GTT GAG GGC TGG GAG CCG TCA TCC AAC AAC GCA AAC ACG GGC ATT 676 
Asn Val Glu Gly Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly He 
190 195 200 

GGA GGA CAC GGA AGC TGC TGC TCT GAG ATG GAT ATC TGG GAG GCC AAC 724 
Gly Gly His Gly Ser Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn 
205 210 215 

TCC ATC TCC GAG GCT CTT ACC CCC CAC CCT TGC ACG ACT GTC GGC CAG 772 
Ser lie Ser Glu Ala Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin 
220 225 230 235 
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GAG ATC TGC GAG GGT GAT GGG TGC GGC GGA ACT TAC TCC GAT AAC AGA 620 
Glu Me Cys Glu Gty Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg 
240 245 250 

TAT GGC GGC ACT TGC GAT CCC GAT GGC TGC GAC TGG AAC CCA TAC CGC 868 
Tyr Gly Gly Thr Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg 
255 260 265 

CTG GGC AAC ACC AGC TTC TAC GGC CCT GGC TCA AGC TTT ACC CTC GAT 916 
Leu Gly Asn Thr Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp 
270 275 280 

ACC ACC AAG AAA TTG ACC GTT GTC ACC CAG TTC GAG ACG TCG GGT GCC 964 
Thr Thr Lys Lys Leu Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala 
285 290 295 



ATC AAC CGA TAC TAT GTC CAG AAT GGC GTC ACT TTC CAG CAG CCC AAC 1 01 2 
lie Asn Arg Tyr Tyr Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn 
300 305 310 315 

GCC GAG CTT GGT AGT TAC TCT GGC AAC GAG CTC AAC GAT GAT TAC TGC 1 060 
Ala Glu Leu Gly Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys 
320 325 330 

ACA GCT GAG GAG GCA GAA TTC GGC GGA TCC TCT TTC TCA GAC AAG GGC 1 1 08 
Thr Ala Glu Glu Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly 
335 340 345 

GGC CTG ACT CAG TTC AAG AAG GCT ACC TCT GGC GGC ATG GTT CTG GTC 1 1 56 
Gly Leu Thr Gin Phe Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val 
350 355 360 

ATG AGT CTG TGG GAT GAT GTG AGTTTG A TGGAC AAAC A TGCGCGTTG A 1 204 

Met Ser Leu Trp Asp Asp 
365 

CAAAGAGTCA AGCAGCTGAC TGAGATGTTA CAG TAC TAC GCC AAC ATG CTG TGG 
1258 

Tyr Tyr Ala Asn Met Leu Trp 
370 375 

CTG GAC TCC ACC TAC CCG ACA AAC GAG ACC TCC TCC ACA CCC GGT GCC 1 306 
Leu Asp Ser Thr Tyr Pro Thr Asn Glu Thr Ser Ser Thr Pro Gly Ala 
380 365 390 



GTG CGC GGA AGC TGC TCC ACC AGC TCC GGT GTC CCT GCT CAG GTC GAA 1354 
Val Arg Gly Ser Cys Ser Thr Ser Ser Gly Val Pro Ala Gin Val Glu 
395 400 405 

TCT CAG TCT CCC AAC GCC AAG GTC ACC TTC TCC AAC ATC AAG TTC GGA 1402 
Ser Gin Ser Pro Asn Ala Lys Val Thr Phe Ser Asn lie Lys Phe Gly 
410 415 420 

CCC ATT GGC AGC ACC GGC AAC CCT AGC GGC GGC AAC CCT CCC GGC GGA 1450 
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Pro lie Gly Ser Thr Gfy Asn Pro SerGly Gly Asn Pro Pro Gly Gty 
425 430 435 440 



AAC 
Asn 



1453 



(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Oi) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro Leu Thr Trp 
15 10 15 

Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly Ser Val 
20 25 30 

Val He Asp Ala Asn Trp Arg Trp Thr His Ala Thr Asn Ser Ser Thr 
35 40 45 

Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp Asn 
50 55 60 

Glu Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala Ser 
65 70 75 80 

Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe Val 
65 90 95 

Thr Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Mel Ala 
100 105 110 

Ser Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe Ser 
115 120 125 

Phe Asp Val Asp Val Ser Gin Leu Pro Cys Gly Leu Asn Gly Ala Leu 
130 135 140 

Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Thr 
145 150 155 160 

Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gin Cys 
165 170 175 

Pro Arg Asp Leu Lys Phe lie Asn Gly Gin Ala Asn Val Glu Gly Trp 
180 185 190 
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Glu Pro Ser Ser Asn Asn Ala Asn Thr Gly He Gly Gly His Gly Ser 
195 200 205 

Cys Cys Ser Glu Met Asp lie Trp Glu Ala Asn Ser lie Ser Glu Ala 
210 215 220 

Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu lie Cys Glu Gly 
225 230 235 240 

Asp Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr Cys 
245 250 255 

Asp Pro Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr Ser 
260 265 270 

Phe Tyr Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys Leu 
275 260 285 

Thr Val Val Thr Gin Phe Glu Thr Ser Gly Ala lie Asn Arg Tyr Tyr 
290 295 300 

Val Gin Asn Gly Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly Ser 
305 310 315 320 

Tyr Ser Gly Asn Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu Ala 
325 330 335 

Glu Phe Gly Gly Ser Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin Phe 
340 345 350 

Lys Lys Ala Thr Ser Gly Gly Met Val Leu Val Met Ser Leu Trp Asp 
355 360 365 

Asp Thr Thr Pro Thr Cys Cys Gly Trp Thr Pro Pro Thr ArQ Gin Thr 
370 375 380 

Arg Pro Pro Pro His Pro Val Pro Cys Ala Glu Ala Ala Pro Pro Ala 
385 390 395 400 

Pro Val Ser Leu Leu Arg Ser Asn Leu Ser Leu Pro Thr Pro Arg Ser 
405 410 415 

Pro Ser Pro Thr Ser Ser Ser Asp Pro Leu Ala Ala Pro Ala Thr Leu 
420 425 430 

Ala Ala Ala Thr Leu Pro Ala Glu 
435 440 

(2) INFORMATION FOR SEQ ID NO:1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: j0in(1..161. 218..465. 556..1244) 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:11: 
TCG GGA ACC GCT ACG TAT TCA GGC AAC CCT TTT GTT GGG GTC ACT CCT 48 

Ser Gly Thr Ala Thr Tyr Ser Gly Asn Pro Phe Val Gly Val Thr Pro 
1 S 10 15 

TGG GCC AAT GCA TAT TAC GCC TCT GAA GTT AGC AGC CTC GCT ATT CCT 96 
Trp Ala Asn Ala Tyr Tyr Ala Ser Glu Val Ser Ser Leu Ala lie Pro 
20 25 30 

AGC TTG ACT GGA GCC ATG GCC ACT GCT GCA GCA GCT GTC GCA AAG GTT 144 
Ser Leu Thr Gly Ala Met Ala Thr Ala Ala Ala Ala Val Ala Lys Val 
35 40 45 

CCC TCT TTT ATG TGG CT GTAGGTCCTC CCGGAACCAA GGCAATCTGT 1 91 

Pro Ser Phe Met Trp Leu 
SO 

TACTGAAGGC TCATCATTCA CTGCAG A GAT ACT CTT GAC AAG ACC CCT CTC 242 

Asp Thr Leu Asp Lys Thr Pro Leu 
55 60 

ATG GAG CAA ACC TTG GCC GAC ATC CGC ACC GCC AAC AAG AAT GGC GGT 290 
Met Glu Gin Thr Leu Ala Asp He Arg Thr Ala Asn Lys Asn Gly Gly 
65 70 75 

AAC TAT GCC GGA CAG TTT GTG GTG ATA GAC TTG CCG GAT CGC GAT TGC 338 
Asn Tyr Ala Gly Gin Phe Val Val lie Asp Leu Pro Asp Arg Asp Cys 
80 85 90 

GCT GCC CTT GCC TCG AAT GGC GAA TAC TCT ATT GCC GAT GGT GGC GTC 386 
Ala Ala Leu Ala Ser Asn Gly Glu Tyr Ser lie Ala Asp Gly Gly Val 
95 100 105 110 

GCC AAA TAT AAG AAC TAT ATC GAC ACC ATT CGT CAA ATT GTC GTG GAA 434 
Sa LysTyr Lys Asn Tyr lie Asp Thr He Arg Gin He Val Val Glu 
•1 15 120 125 

TAT TCC GAT ATC CGG ACC CTC CTG GTT ATT G GTATGAGTTT AAACACCTGC 485 
Tyr Ser Asp He Arg Thr Leu Leu Val He 
130 135 

CTCCCCCCCC CCTTCCCTTC CTTTCCCGCC GGCATCTTGT CGTTGTGCTA ACTATTGTTC 
545 

CCTCTTCCAG AG CCT GAC TCT CTT GCC AAC CTG GTG ACC AAC CTC GGT 593 
Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn Leu Gly 
140 145 
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ACT CCA AAG TGT GCC AAT GCT CAG TCA GCC TAC CTT GAG TGC ATC AAC 641 
Thr Pro Lys Cys Ala Asn Ala Gin Ser Ala Tyr Leu Glu Cys He Asn 
150 155 160 165 

TAC GCC GTC ACA CAG CTG AAC CTT CCA AAT GTT GCG ATG TAT TTG GAC 689 
Tyr Ala Val Thr Gin Leu Asn Leu Pro Asn Val Ala Met Tyr Leu Asp 
170 175 180 

GCT GGC CAT GCA GGA TGG CTT GGC TGG CCG GCA AAC CAA GAC CCG GCC 737 
Ala Gly His Ala Gly Trp Leu Gly Trp Pro Ala Asn Gin Asp Pro Ala 
185 190 195 

GCT CAG CTA TTT GCA AAT GTT TAC AAG AAT GCA TCG TCT CCG AGA GCT 785 
Ala Gin Leu Phe Ala Asn Val Tyr Lys Asn Ala Ser Ser Pro Arg Ala 
200 205 210 

CTT CGC GGA TTG GCA ACC AAT GTC GCC AAC TAC AAC GGG TGG AAC ATT 633 
Leu Arg Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Gly Trp Asn He 
215 220 225 

ACC AGC CCC CCA TCG TAC ACG CAA GGC AAC GCT GTC TAC AAC GAG AAG 881 
Thr Ser Pro Pro Ser Tyr Thr Gin Gly Asn Ala Val Tyr Asn Glu Lys 
230 235 240 245 

CTG TAC ATC CAC GCT ATT GGA CCT CTT CTT GCC AAT CAC GGC TGG TCC 929 
Leu Tyr lie His Ala Me Gly Pro Leu Leu Ala Asn His Gly Trp Ser 
250 255 260 



AAC GCC TTC TTC ATC ACT GAT CAA GGT CGA TCG GGA AAG CAG CCT ACC 977 
Asn Ala Phe Phe He Thr Asp Gin Gly Arg Ser Gly Lys Gin Pro Thr 
265 270 275 

GGA CAG CAA CAG TGG GGA GAC TGG TGC AAT GTG ATC GGC ACC GGA TTT 1025 
Gly Gin Gin Gin Trp Gly Asp Trp Cys Asn Val He Gly Thr Gly Phe 
280 265 290 

GGT ATT CGC CCA TCC GCA AAC ACT GGG GAC TCG TTG CTG GAT TCG TTT 1 073 
Gly He Arg Pro Ser Ala Asn Thr Gly Asp Ser Leu Leu Asp Ser Phe 
295 300 305 

GTC TGG GTC AAG CCA GGC GGC GAG TGT GAC GGC ACC AGC GAC AGC AGT 1121 
Val Trp Val Lys Pro Gly Gly Glu Cys Asp Gly Thr Ser Asp Ser Ser 
310 315 320 325 

GCG CCA CGA TTT GAC TCC CAC TGT GCG CTC CCA GAT GCC TTG CAA CCG 1 169 
Ala Pro Arg Phe Asp Ser His Cys Ala Leu Pro Asp Ala Leu Gin Pro 
330 335 340 

GCG CCT CAA GCT GGT GCT TGG TTC CAA GCC TAC TTT GTG CAG CTT CTC 1217 
Ala Pro Gin Ala Gly Ala Trp Phe Gin Ala Tyr Phe Val Gin Leu Leu 
345 350 355 

ACA AAC GCA AAC CCA TCG TTC CTG TA 1244 
Thr Asn Ala Asn Pro Ser Phe Leu 
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360 365 

(2) INFORMATION FOR SEO ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 amino acids 

(B) TYPE: amino add 
(D) TOPOLOGY: linear 

(«) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Ser Gly Thr Ala Thr Tyr Ser Gly Asn Pro Phe Val Gly Val Thr Pro 
,5 10 15 

Tro Ala Asn Ala Tyr Tyr Ala Ser Glu Val Ser Ser Leu Ala lie Pro 
20 25 30 

Ser Leu Thr Gly Ala Met Ala Thr Ala Ala Ala Ala Val Ala Lys Val 
35 40 45 

Pro Ser Phe Met Trp Leu Asp Thr Leu Asp Lys Thr Pro Leu Met Glu 
50 55 60 

Gin Thr Leu Ala Asp lie Arg Thr Ala Asn Lys Asn Gly Gly Asn Tyr 
65 70 75 80 

Ala Gly Gin Phe Val Val He Asp Leu Pro Asp Arg Asp Cys Ala Ala 
65 90 95 

Leu Ala Ser Asn Gly Glu Tyr Ser lie Ala Asp Gly Gly Val Ala Lys 
100 105 110 

Tyr Lys Asn Tyr lie Asp Thr lie Arg Gin He Val Val Glu Tyr Ser 
115 120 125 

Asp He Arg Thr Leu Leu Val lie Glu Pro Asp Ser Leu Ala Asn Leu 
130 135 140 

Val Thr Asn Leu Gly Thr Pro Lys Cys Ala Asn Ala Gin Ser Ala Tyr 

145 150 155 160 

Leu Glu Cys He Asn Tyr Ala Val Thr Gin Leu Asn Leu Pro Asn Val 
165 170 175 

Ala Met Tyr Leu Asp Ala Gly His Ala Gly Trp Leu Gly Trp Pro Ala 
180 185 190 

Asn Gin Asp Pro Ala Ala Gin Leu Phe Ala Asn Val Tyr Lys Asn Ala 
195 200 205 

Ser Ser Pro Arg Ala Leu Arg Gly Leu Ala Thr Asn Val Ala Asn Tyr 
210 215 220 
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Asn Gly Tip Asn He Thr Ser Pro Pro Ser Tyr Thr Gin Gly Asn Ala 
225 230 235 240 

Val Tyr Asn Glu Lys Leu Tyr lie His Ala lie Gly Pro Leu Leu Ala 
245 250 255 

Asn His Gly Trp Ser Asn Ala Phe Phe He Thr Asp Gin Gly Arg Ser 
260 265 270 

Gly Lys Gin Pro Thr Gly Gin Gin Gin Trp Gly Asp Trp Cys Asn Val 
275 260 265 

He Gly Thr Gly Phe Gly He Arg Pro Ser Ala Asn Thr Gly Asp Ser 
290 295 300 

Leu Leu Asp Ser Phe Val Trp Val Lys Pro Gly Gly Glu Cys Asp Gly 
305 310 315 320 

Thr Ser Asp Ser Ser Ala Pro Arg Phe Asp Ser His Cys Ala Leu Pro 
325 330 335 

Asp Ala Leu Gin Pro Ala Pro Gin Ala Gly Ala Trp Phe Gin Ala Tyr 
340 345 350 

Phe Val Gin Leu Leu Thr Asn Ala Asn Pro Ser Phe Leu 
355 360 365 



(2) INFORMATION FOR SEQ ID NO:13: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join(1..704, 775..1201) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

CAG CAA CCG GGT ACC AGC ACC CCC GAG GTC CAT CCC AAG TTG ACA ACC 46 
Gin Gin Pro Gly Thr Ser Thr Pro Glu Val His Pro Lys Leu Thr Thr 
1 5 10 15 

TAC AAG TGT ACA AAG TCC GGG GGG TGC GTG GCC CAG GAC ACC TCG GTG 96 
Tyr Lys Cys Thr Lys Ser Gly Gly Cys Val Ala Gin Asp Thr Ser Val 
20 25 30 

GTC CTT GAC TGG AAC TAC CGC TGG ATG CAC GAC GCA AAC TAC AAC TCG 144 
Val Leu Asp Trp Asn Tyr Arg Trp Met His Asp Ala Asn Tyr Asn Ser 
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35 40 45 

TGC ACC GTC AAC GGC GGC GTC AAC ACC ACG CTC TGC CCT GAC GAG GCG 1 92 
Cys Thr Val Asn Gly Gly Val Asn Thr Thr Leu Cys Pro Asp Glu Ala 
50 55 60 

ACC TGT GGC AAG AAC TGC TTC ATC GAG GGC GTC GAC TAC GCC GCC TCG 240 
Thr Cys Gly Lys Asn Cys Phe He Glu Gly Val Asp Tyr Ala Ala Ser 
65 70 75 SO 

GGC GTC ACG ACC TCG GGC AGC AGC CTC ACC ATG AAC CAG TAC ATG CCC 288 
Gly Val Thr Thr Ser Gly Ser Ser Leu Thr Met Asn Gin Tyr Met Pro 
85 BO 95 

AGC AGC TCT GGC GGC TAC AGC AGC GTC TCT CCT CGG CTG TAT CTC CTG 336 
Ser Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro Arg Leu Tyr Leu Leu 
100 105 110 

GAC TCT GAC GGT GAG TAC GTG ATG CTG AAG CTC AAC GGC CAG GAG CTG 384 
Asp Ser Asp Gly Glu Tyr Val Met Leu Lys Leu Asn Gly Gin Glu Leu 
115 120 125 

AGC TTC GAC GTC GAC CTC TCT GCT CTG CCG TGT GGA GAG AAC GGC TCG 432 
Ser Phe Asp Val Asp Leu Ser Ala Leu Pro Cys Gly Glu Asn Gly Ser 
130 135 140 

CTC TAC CTG TCT CAG ATG GAC GAG AAC GGG GGC GCC AAC CAG TAT AAC 480 
Leu Tyr Leu Ser Gin Met Asp Glu Asn Gly Gly Ala Asn Gin Tyr Asn 
145 150 155 160 

ACG GCC GGT GCC AAC TAC GGG AGC GGC TAC TGC GAT GCT CAG TGC CCC 528 
Thr Ala Gly Ala Asn Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys Pro 
165 170 175 

GTC CAG ACA TGG AGG AAC GGC ACC CTC AAC ACT AGC CAC CAG GGC TTC 576 
Val Gin Thr Trp Arg Asn Gly Thr Leu Asn Thr Ser His Gin Gly Phe 
180 185 190 



TGC TGC AAC GAG ATG GAT ATC CTG GAG GGC AAC TCG AGG GCG AAT GCC 624 
Cys Cys Asn Glu Met Asp lie Leu Glu Gly Asn Ser Arg Ala Asn Ala 
195 200 205 

TTG ACC CCT CAC TCT TGC ACG GCC ACG GCC TGC GAC TCT GCC GGT TGC 672 

Leu Thr Pro His Ser Cys Thr Ala Thr Ala Cys Asp Ser Ala Gly Cys 
210 215 220 

GGC TTC AAC CCC TAT GGC AGC GGC TAC AAA AG GTGAGCCTGA 714 
Gly Phe Asn Pro Tyr Gly Ser Gly Tyr Lys Ser 
225 230 235 

TGCCACTACT ACCCCTTTCC TGGCGCTCTC GCGGTTTTCC ATGCTGACAT 
GGTTTTCCAG 774 

C TAC TAC GGC CCC GGA GAT ACC GTT GAC ACC TCC AAG ACC TTC ACC 820 
Tyr Tyr Gly Pro Gly Asp Thr Val Asp Thr Ser Lys Thr Phe Thr 
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240 24S 250 

ATC ATC ACC CAG TTC AAC ACG GAC AAC GGC TCG CCC TCG GGC AAC CTT 868 
lie lie Thr Gin Phe Asn Thr Asp Asn Gly Ser Pro Ser Gly Asn Leu 
255 260 265 

GTG AGC ATC ACC CGC AAG TAC CAG CAA AAC GGC GTC GAC ATC CCC AGC 916 
Val Ser lie Thr Arg Lys Tyr Gin Gin Asn Gly Val Asp He Pro Ser 
270 275 280 

GCC CAG CCC GGC GGC GAC ACC ATC TCG TCC TGC CCG TCC GCC TCA GCC 064 
Ala Gin Pro Gly Gly Asp Thr lie Ser Ser Cys Pro Ser Ala Ser Ala 
285 290 295 

TAC GGC GGC CTC GCC ACC ATG GGC AAG GCC CTG AGC AGC GGC ATG GTG 1012 
Tyr Gly Gly Leu Ala Thr Met Gly Lys Ala Leu Ser Ser Gly Met Val 
300 305 310 

CTC GTG TTC AGC ATT TGG AAC GAC AAC AGC CAG TAC ATG AAC TGG CTC 1060 

Leu Val Phe Ser He Trp Asn Asp Asn Ser Gin Tyr Met Asn Trp Leu 

315 320 325 330 

GAC AGC GGC AAC GCC GGC CCC TGC AGC AGC ACC GAG GGC AAC CCA TCC 1 108 
Asp Ser Gly Asn Ala Gly Pro Cys Ser Ser Thr Glu Gly Asn Pro Ser 
335 340 345 

AAC ATC CTG GCC AAC AAC CCC AAC ACG CAC GTC GTC TTC TCC AAC ATC 1 1 56 
Asn He Leu Ala Asn Asn Pro Asn Thr His Val Val Phe Ser Asn He 
350 355 360 

CGC TGG GGA GAC ATT GGG TCT ACT ACG AAC TCG ACT GCG CCC CCG 1201 
Arg Trp Gly Asp lie Gly Ser Thr Thr Asn Ser Thr Ala Pro Pro 
365 370 375 



(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Gin Gin Pro Gly Thr Ser Thr Pro Glu Val His Pro Lys Leu Thr Thr 
1 5 10 15 

Tyr Lys Cys Thr Lys Ser Gly Gly Cys Val Ala Gin Asp Thr Ser Val 
20 25 30 

Val Leu Asp Trp Asn Tyr Arg Trp Met His Asp Ala Asn Tyr Asn Ser 
35 40 45 

Cys Thr Val Asn Gly Gly Val Asn Thr Thr Leu Cys Pro Asp Glu Ala 
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50 55 60 

Thr Cys Gly Lys Asn Cys Phe lie Glu Gly Val Asp Tyr Ala Ala Ser 
65 70 75 80 

Gly Val Thr Thr Ser Gly Ser Ser Leu Thr Met Asn Gin Tyr Met Pro 
85 90 95 

Ser Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro Ary Leu Tyr Leu Leu 
100 105 110 

Asp Ser Asp Gly Glu Tyr Val Met Leu Lys Leu Asn Gly Gin Glu Leu 
115 120 125 

Ser Phe Asp Val Asp Leu Ser Ala Leu Pro Cys Gly Glu Asn Gly Ser 
130 135 140 

Leu Tyr Leu Ser Gin Met Asp Glu Asn Gly Gly Ala Asn Gin Tyr Asn 

145 150 155 160 

Thr Ala Gly Ala Asn Tyr Gly Ser Gly Tyr Cys Asp Ala Gin Cys Pro 
165 170 175 

Val Gin Thr Trp Arp. Asn Gly Thr Leu Asn Thr Ser His Gin Gly Phe 
160 185 190 

Cys Cys Asn Glu Met Asp lie Leu Glu Gly Asn Ser Arp. Ala Asn Ala 
195 200 205 

Leu Thr Pro His Ser Cys Thr Ala Thr Ala Cys Asp Ser Ala Gly Cys 
210 215 220 

Gly Phe Asn Pro Tyr Gly Ser Gly Tyr Lys Ser Tyr Tyr Gly Pro Gly 

230 235 240 



Asp Thr Val Asp Thr Ser Lys Thr Phe Thr He lie Thr Gin Phe Asn 
245 250 255 

Thr Asp Asn Gly Ser Pro Ser Gly Asn Leu Val Ser lie Thr Arg Lys 
260 265 270 

Tyr Gin Gin Asn Gly Val Asp He Pro Ser Ala Gin Pro Gly Gly Asp 
275 280 285 

Thr He Ser Ser Cys Pro Ser Ala Ser Ala Tyr Gly Gly Leu Ala Thr 
290 295 300 

Met Gly Lys Ala Leu Ser Ser Gly Mel Val Leu Val Phe Ser He Trp 
305 310 315 320 

Asn Asp Asn Ser Gin Tyr Met Asn Trp Leu Asp Ser Gly Asn Ala Gly 
325 330 335 

Pro Cys Ser Ser Thr Glu Gly Asn Pro Ser Asn He Leu Ala Asn Asn 
340 345 350 
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Pro Asn Thr His Val Val Phe Ser Asn lie ArQ Trp Gly Asp tie Gly 
35S 360 365 

Ser Thr Thr Asn Ser Thr Ala Pro Pro 
370 375 

(2) INFORMATION FOR SEQ ID NO:15: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join(1..56, 231.. 11 58) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

GGG GTC CGA TTT GCC GGC GTT AAC ATC GCG GGT TTT GAC TTT GGC TGT 48 
Gly Val Arg Phe Ala Gly Val Asn He Ala Gly Phe Asp Phe Gly Cys 
1 5 10 15 

ACC ACA GA GTGAGTACCC TTGTTTCCTG GTGTTGCTGG CTGGTTGGGC 96 
Thr Thr Asp 



GGGTATACAG CGAAGCGGAC GCAAGAACAC CGCCGGTCCG CCACCATCAA 
GATGTGGGTG 156 

GTAAGCGGCG GTGTTTTGTA CAACTACCTG ACAGCTCACT CAGGAAATGA 
GAATTAATGG 216 

AAGTCTTGTT ACAG T GGC ACT TGC GTT ACC TCG AAG GTT TAT CCT CCG 264 
Gly Thr Cys Val Thr Ser Lys Val Tyr Pro Pro 
20 25 30 

TTG AAG AAC TTC ACC GGC TCA AAC AAC TAC CCC GAT GGC ATC GGC CAG 312 
Leu Lys Asn Phe Thr Gly Ser Asn Asn Tyr Pro Asp Gly lie Gly Gin 
35 40 45 

ATG CAG CAC TTC GTC AAC GAG GAC GGG ATG ACT ATT TTC CGC TTA CCT 360 
Met Gin His Phe Val Asn Glu Asp Gly Met Thr lie Phe Arg Leu Pro 
50 55 60 



GTC GGA TGG CAG TAC CTC GTC AAC AAC AAT TTG GGC GGC AAT CTT GAT 408 
Val Gly Trp Gin Tyr Leu Val Asn Asn Asn Leu Gly Gly Asn Leu Asp 
65 70 75 
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600 



TCC ACG AGC ATT TCC AAG TAT GAT CAG CTT GTT CAG GGG TGC CTG TCT 456 
Ser Thr Ser lie Ser Lys Tyr Asp Gin Leu Val Gin Gly Cys Leu Ser 
60 65 90 

CTG GGC GCA TAC TGC ATC GTC GAC ATC CAC AAT TAT GCT CGA TGG AAC 504 
Leu Gly Ala Tyr Cys He Val Asp He His Asn Tyr Ala Arg Trp Asn 
95 100 105 110 

GGT GGG ATC ATT GGT CAG GGC GGC CCT ACT AAT GCT CAA TTC ACG AGC 552 
Gly Gly lie He Gly Gin Gly Gly Pro Thr Asn Ala Gin Phe Thr Ser 
115 120 125 

CTT TGG TCG CAG TTG GCA TCA AAG TAC GCA TCT CAG TCG AGG GTG TGG 
Leu Trp Ser Gin Leu Ala Ser Lys Tyr Ala Ser Gin Ser Arg Val Trp 
130 135 140 

TTC GGC ATC ATG AAT GAG CCC CAC GAC GTG AAC ATC AAC ACC TGG GCT 646 
Phe Gly He Met Asn Glu Pro His Asp Val Asn lie Asn Thr Trp Ala 
145 150 155 

GCC ACG GTC CAA GAG GTT GTA ACC GCA ATC CGC AAC GCT GGT GCT ACG 696 
Ala Thr Val Gin Glu Val Val Thr Ala He Arg Asn Ala Gly Ala Thr 
160 165 170 

TCG CAA TTC ATC TCT TTG CCT GGA AAT GAT TGG CAA TCT GCT GGG GCT 744 
Ser Gin Phe lie Ser Leu Pro Gly Asn Asp Trp Gin Ser Ala Gly Ala 
175 180 185 190 

TTC ATA TCC GAT GGC AGT GCA GCC GCC CTG TCT CAA GTC ACG AAC CCG 792 
Phe He Ser Asp Gly Ser Ala Ala Ala Leu Ser Gin Val Thr Asn Pro 
195 200 205 

GAT GGG TCA ACA ACG AAT CTG ATT TTT GAC GTG CAC AAA TAC TTG GAC 840 
Asp Gly Ser Thr Thr Asn Leu lie Phe Asp Val His Lys Tyr Leu Asp 
210 215 220 

TCA GAC AAC TCC GGT ACT CAC GCC GAA TGT ACT ACA AAT AAC ATT GAC 888 
Ser Asp Asn Ser Gly Thr His Ala Glu Cys Thr Thr Asn Asn He Asp 
225 230 235 

GGC GCC TTT TCT CCG CTT GCC ACT TGG CTC CGA CAG AAC AAT CGC CAG 936 
Gly Ala Phe Ser Pro Leu Ala Thr Trp Leu Arg Gin Asn Asn Arg Gin 
240 245 250 

GCT ATC CTG ACA GAA ACC GGT GGT GGC AAC GTT CAG TCC TGC ATA CAA 984 
A?N 5. tSoEtS Gly Gly Gly Asn Val Gin Ser Cys He Gin 
255 260 265 270 

GAC ATG TGC CAG CAA ATC CAA TAT CTC AAC CAG AAC TCA GAT GTC TAT 1032 
Asp Met Cys Gin Gin He Gin Tyr Leu Asn Gin Asn Ser Asp Val Tyr 
275 260 285 

CTT GGC TAT GTT GGT TGG GGT GCC GGA TCA TTT ' ftAT AGO ACG TAT GTC 1080 
Leu Gly Tyr Val Gly Trp Gly Ala Gly Ser Phe Asp Ser Thr Tyr Val 
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290 295 300 

CTG ACG GAA ACA CCG ACT AGC AGT GGT AAC TCA TGG ACG GAC ACA TCC 1 128 
Leu Thr Glu Thr Pro Thr Ser Ser Gly Asn Ser Trp Thr Asp Thr Ser 
305 310 315 

TTG GTC AGC TCG TGT CTC GCA AGA AAG TA 1 1 58 

Leu Val Ser Ser Cys Leu Ala Arg Lys 
320 325 



(2) INFORMATION FOR SEQ ID NO:18: 

(0 SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 327 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Gly Val Arg Phe Ala Gly Val Asn lie Ala Gly Phe Asp Phe Gly Cys 
15 10 15 

Thr Thr Asp Gly Thr Cys Val Thr Ser Lys Val Tyr Pro Pro Leu Lys 
20 25 30 

Asn Phe Thr Gly Ser Asn Asn Tyr Pro Asp Gly He Gly Gin Met Gin 
35 40 45 

His Phe Val Asn Glu Asp Gly Met Thr He Phe Arg Leu Pro Val Gly 
50 55 80 

Trp Gin Tyr Leu Val Asn Asn Asn Leu Gly Gly Asn Leu Asp Ser Thr 
65 70 75 80 

Ser He Ser Lys Tyr Asp Gin Leu Val Gin Gly Cys Leu Ser Leu Gly 
85 90 95 

Ala Tyr Cys He Val Asp He His Asn Tyr Ala Arg Trp Asn Gly Gly 
100 105 110 

He He Gly Gin Gly Gly Pro Thr Asn Ala Gin Phe Thr Ser Leu Trp 
115 120 125 

Ser Gin Leu Ala Ser Lys Tyr Ala Ser Gin Ser Arg Val Trp Phe Gly 
130 135 140 

lie Met Asn Glu Pro His Asp Val Asn He Asn Thr Trp Ala Ala Thr 
145 150 155 160 

Val Gin Glu Val Val Thr Ala He Arg Asn Ala Gly Ala Thr Ser Gin 
165 170 175 

Phe lie Ser Leu Pro Gly Asn Asp Trp Gin Ser Ala Gly Ala Phe He 
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180 185 190 

Ser Asp Gly Ser Ala Ala Ala Leu Ser Gin Val Thr Asn Pro Asp Gly 
195 200 205 

Ser Thr Thr Asn Leu He Phe Asp Val His Lys Tyr Leu Asp Ser Asp 
210 215 220 

Asn Ser Gly Thr His Ala Glu Cys Thr Thr Asn Asn lie Asp Gly Ala 
225 230 235 240 

Phe Ser Pro Leu Ala Thr Trp Leu Arg Gin Asn Asn Arg Gin Ala lie 
245 250 255 

Leu Thr Glu Thr Gly Gly Gly Asn Val Gin Ser Cys lie Gin Asp Met 
260 265 270 

Cys Gin Gin lie Gin Tyr Leu Asn Gin Asn Ser Asp Val Tyr Leu Gly 
275 280 285 

Tyr Val Gly Trp Gly Ala Gly Ser Phe Asp Ser Thr Tyr Val Leu Thr 
290 295 300 

Glu Thr Pro Thr Ser Ser Gly Asn Ser Trp Thr Asp Thr Ser Leu Val 
305 310 315 320 

Ser Ser Cys Leu Ala Arg Lys 
325 



(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..72 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

CGT GGC ACC ACC ACC ACC CGC CGC CCA GCC ACT ACC ACT GGA AGC TCT 
Arg Gly Thr Thr Thr Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser 
15 10 15 

CCC GGA CCT ACC CAG TCT CAC TAC 72 
Pro Gly Pro Thr Gin Ser His Tyr 
20 
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(2) INFORMATION FOR SEQ ID NO:1B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ID MOLECULE TYPE: protein 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Arg Gly Thr Thr Thr Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser 
1 5 10 15 

Pro Gly Pro Thr Gin Ser His Tyr 
20 

(2) INFORMATION FOR SEQ ID NO:19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..129 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

GGC GCT GCA AGC TCA AGC TCG TCC ACG CGC GCC GCG TCG ACG ACT TCT 48 
Gly Ala Ala Ser Ser Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser 
1 5 10 15 

CGA GTA TCC CCC ACA ACA TCC CGG TCG AGC TCC GCG ACG CCT CCA CCT 96 
Arg Val Ser Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro 
20 25 30 

GGT TCT ACT ACT ACC AG A GTA CCT CCA GTC GGA 129 
Gly Ser Thr Thr Thr Arg Val Pro Pro Val Gly 
35 40 



(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



WO 96/23928 PCT/US96/00977 



(iO MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Gly Ala Ala Ser Scr Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser 
1 5 10 15 

Arg Val Ser Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro 
20 25 30 

Gly Ser Thr Thr Thr Arg Val Pro Pro Val Gly 
35 40 

(2) INFORMATION FOR SEQ ID NO:21: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

CO MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..81 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

CCC CCG CCT GCG TCC AGC ACG ACG TTT TCG ACT ACA CCG AGG AGC TCG 48 
Pro Pro Pro Ala Ser Ser Thr Thr Phe Ser Thr Thr Pro Arg Ser Ser 
1 5 10 15 



ACG ACT TCG AGC AGC CCG AGC TGC ACG CAG ACT 81 
Thr Thr Ser Ser Ser Pro Ser Cys Thr Gin Thr 
20 25 



(2) INFORMATION FOR SEQ ID NO:22: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Pro Pro Pro Ala Ser Ser Thr Thr Phe Ser Thr Thr Pro Arg Ser Ser 
1 5 10 15 

Thr Thr Ser Ser Ser Pro Ser Cys Thr Gin Thr 
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20 25 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA (genomic) 



fix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..102 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

CCG GGA GCC ACT ACT ATC ACC ACT TCG ACC CGG CCA CCA TCC GGT CCA 48 
Pro Gly Ala Thr Thr He Thr Thr Ser Thr Arg Pro Pro Ser Gly Pro 
1 5 10 15 

ACC ACC ACC ACC AGG GCT ACC TCA ACA AGC TCA TCA ACT CCA CCC ACG 96 
Thr Thr Thr Thr Arg Ala Thr Ser Thr Ser Ser Ser Thr Pro Pro Thr 
20 25 30 

AGCTCT 102 
Ser Ser 



(2) INFORMATION FOR SEQ ID NO:24: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: protein 

(xi*) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

Pro Gly Ala Thr Thr lie Thr Thr Ser Thr Arg Pro Pro Ser Gly Pro 
1 5 10 15 

Thr Thr Thr Thr Arg Ala Thr Ser Thr Ser Ser Ser Thr Pro Pro Thr 
20 25 30 

Ser Ser 



(2) INFORMATION FOR SEQ ID NO:25: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iO MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..51 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

ATG TAT CGG AAG TTG GCC GTC ATC TCG GCC TTC TTG GCC ACA GCT CGT 48 
Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 
1 5 10 15 

GCT 51 
Ala 



(2) INFORMATION FOR SEO ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Met Tyr Arg Lys Leu Ala Val lie Ser Ala Phe Leu Ala Thr Ala Arg 
1 5 10 15 

Ala 



(2) INFORMATION FOR SEQ ID NO:27: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..72 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N027: 

ATG ATT GTC G6C ATT CTC ACC ACG CTG GCT ACG CTG GCC ACA CTC GCA 48 
Met lie Val Gly lie Leu Thr Thr Leu Ala Thr Leu Ala Thr Leu Ala 
1 5 10 15 

GCT AGT GTG CCT CTA GAG GAG CGG 72 
Ala Ser Val Pro Leu Glu Glu Aig 
20 



(2) INFORMATION FOR SEQ ID NO:2B: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met He Val Gly lie Leu Thr Thr Leu Ala Thr Leu Ala Thr Leu Ala 
15 10 15 

Ala Ser Val Pro Leu Glu Glu Arg 
20 

(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA (genomic) 



Ox) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..66 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

ATG GCG CCC TCA GTT ACA CTG CCG TTG ACC ACG GCC ATC CTG GCC ATT 48 
Met Ala Pro Ser Val Thr Leu Pro Leu Thr Thr Ala lie Leu Ala He 
15 10 15 

* 

GCC CGG CTC GTC GCC GCC 66 
Ala Arg Leu Val Ala Ala 
20 
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(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Met Ala Pro Ser Val Thr Leu Pro Leu Thr Thr Ala He Leu Ala lie 
1 5 10 15 

Ala Arg Leu Val Ala Ala 
20 

(2) INFORMATION FOR SEQ ID NO;31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
P) TOPOLOGY: linear 

0i) MOLECULE TYPE: DNA (genomic) 



Ox) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..63 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

ATG AAC AAG TCC GTG GCT CCA TTG CTG CTT GCA GCG TCC ATA CTA TAT 48 
Met Asn Lys Ser Val Ala Pro Leu Leu Leu Ala Ala Ser lie Leu Tyr 
1 5 10 15 

GGC GGC GCC GTC GCA 63 
Gly Gly Ala Val Ala 
20 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
Met Asn Lys Ser Val Ala Pro Leu Leu Leu Ala Ala Ser lie Leu Tyr 
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15 10 15 

Gly Gly Ala Val Ala 
20 



(2) INFORMATION FOR SEQ ID NO:33: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

AAACCAGCTG TGACCAGTGG GCAACCTTCA CTGGCAACGG CTACACAGTC 
AGCAACAACC 60 

TTTGGGGAGC ATCAGCCGGC TCTGGATTTG GCTGCGTGAC GGCGGTATCG 
CTCAGCGGCG 120 

GGGCCTCCTG GCACGCAGAC TGGCAGTGGT CCGGCGGCCA GAACAACGTC 
AAGTCGTACC 180 

AGAACTCTCA GATTGCCATT CCCCAGAAGA GGACCGTCAA CAGCATCAGC 
AGCATGCCCA 240 

CCACTGCCAG CTGGAGCTAC AGCGGGAGCA ACATCCGCGC TAATGTTGCG 
TATGACTTGT 300 

TCACCGCAGC CAACCCGAAT CATGTCACGT ACTCGGGAGA CTACGAACTC 
ATGATCTGGT 360 

AAGCCATAAG AAGTGACCCT CCTTGATAGT TTCGACTAAC AACATGTCTT 
GAGGCTTGGC 420 

AAATACGGCG ATATTGGGCC GATTGGGTCC TCACAGGGAA CAGTCAACGT 
CGGTGGCCAG 480 

AGCTGGACGC TCTACTATGG CTACAACGGA GCCATGCAAG TCTATTCCTT 
TGTGGCCCAG 540 

ACCAACACTA CCAACTACAG CGGAGATGTC AAGAACTTCT TCAATTATCT CCGAGACAAT 
600 

AAAGGATACA ACGCTGCAGG CCAATATGTT CTTAGTAAGT CACCCTCACT 
GTGACTGGGC 660 

TGAGTTTGTT GCAACGTTTG CTAACAAAAC CTTCGTATAG GCTACCAATT TGGTACCGAG 
720 ' 
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CCCTTCACGG GCAGTGGAAC TCTGAACGTC GCATCCTGGA CCGCATCTAT CAACTAA 
777 



2) INFORMATION FOR SEQ ID NO:34: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANOEDNESS: Single 

(D) TOPOLOGY: linear 

(if) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Gin Thr Ser Cys Asp Gin Tip Ala Thr Phe Thr Gly Asn Gly Tyr Thr 
15 10 15 

Val Ser Asn Asn Leu Trp Gly Ala Ser Ala Gly Ser Gly Phe Gly Cys 
20 25 30 

Val Thr Ala Val Ser Leu Ser Gly Gly Ala Ser Trp His Ala Asp Trp 
35 40 45 

Gin Trp Ser Gly Gly Gin Asn Asn Val Lys Ser Tyr Gin Asn Ser Gin 
50 55 60 

He Ala lie Pro Gin Lys Arg Thr Val Asn Ser lie Ser Ser Met Pro 
65 70 75 80 

Thr Thr Ala Ser Trp Ser Tyr Ser Gly Ser Asn lie Arg Ala Asn Val 
85 90 95 

Ala Tyr Asp Leu Phe Thr Ala Ala Asn Pro Asn His Val Thr Tyr Ser 
100 105 110 

Gly Asp Tyr Glu Leu Met lie Trp Leu Gly Lys Tyr Gly Asp lie Gly 
115 120 125 

Pro He Gly Ser Ser Gin Gly Thr Val Asn Val Gly Gly Gin Ser Trp 
130 135 140 

Thr Leu Tyr Tyr Gly Tyr Asn Gly Ala Met Gin Val Tyr Ser Phe Val 
145 150 155 160 

Ala Gin Thr Asn Thr Thr Asn Tyr Ser Gly Asp Val Lys Asn Phe Phe 
165 170 175 

Asn Tyr Leu Arg Asp Asn Lys Gly Tyr Asn Ala Ala Gly Gin Tyr Val 
180 185 190 

Leu Ser Tyr Gin Phe Gly Thr Glu Pro Phe Thr Gly Ser Gly Thr Leu 
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195 200 205 

Asn Val Ala Ser Trp Thr Ala Ser lie Asn 
210 215 



(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
ATGAAGTTCC TTCAAGTCCT CCCTGCCCTC ATACCGGCCG CCCTGGCCC 49 



(2) INFORMATION FOR SEQ ID NO:36: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Met Lys Phe Leu Gin Val Leu Pro Ala Leu lie Pro Ala Ala Leu Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO:37: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0.37: 

AGCTCGTAGA GCGTTGACTT GCCTGTGGTC TGTCCAGACG GGGGACGATA GAATGCG 
57 



(2) INFORMATION FOR SEQ ID NO:38: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
GTCACCTTCT CCAACATCAA GTTCGGACCC ATTGGCAGCA CCGGCTAA 
(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
GGGGTTTAAA CCCGCGGGGA TT 
(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
TGAGCCGAGG CCTCC 
(2) INFORMATION FOR SEQ ID NO:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
AGCTTGAG AT CTGAAGCT 1 8 

(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
CTAGAGGAGC GGTCGGGAAC CGCTAC 26 
(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Leu Glu Glu Arg Ser Gly Thr Ala Thr 
1 5 
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WE claim : 

1. A method of treating cellulose containing fabrics 
with cellulase comprising the steps of: 

(a) contacting said cellulose containing fabric 
with a treating composition comprising an effective 
amount of truncated cellulase, truncated cellulase 
derivative or a naturally occurring cellulase which lacks 
a cellulose binding domain; and 

(b) incubating said cellulose containing fabric in 
contact with said truncated cellulase or truncated 
cellulase derivative under conditions effective to treat 
said fabric. 

2. The method according to claim 1, wherein said 
truncated cellulase enzyme comprises truncated cellulase core 
or a derivative thereof. 

3. The method according to claim 2, wherein said 
truncated cellulase core comprises truncated cellulase 
selected from the group consisting of EG I type core, EGII type 
core, CBHI type core, CBHII type core, EGV type core and 
derivatives thereof. 

4. The method according to claim 1, wherein said 
treating composition further comprises EGIII. 

5. The method according to claim 1, wherein said 
cellulose containing fabric comprises a cotton containing 

fabric. 

6. The method according to claim 5, wherein said cotton 
containing fabric comprises dyed denim. 

7. The method according to claim 1, wherein said 
treating composition comprises truncated cellulase or 
truncated cellulase derivative in a concentration of about 0.1 
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to about 1,000 ppm total protein. 

8. The method according to claim 1, wherein said 
treating composition comprises truncated cellulase or 
truncated cellulase derivative in a concentration of about 0.2 
to about 500 ppm. 

9. The method according to claim 1, wherein said 
truncated cellulase or derivative thereof is derived from a 
microorganism which is a fungus or bacteria. 

10. The method according to claim 9, wherein said fungus 
is Trichoderma sd. 

11. The method according to claim 10, wherein said 
TritflPtenna Sffi is Trichoderma lonaibrachiatum. 

« 

12. The method according to claim 1, wherein said 
treating method comprises stonewashing the cellulose 
containing fabric and said treating composition comprises a 
stonewashing composition. 

13. The method according to claim 12, wherein said 
cellulose containing fabric is colored. 

14. The method according to claim 13, wherein said 
colored fabric is dyed denim. 

15. The method according to claim 12, comprising the 
additional step of treating said fabric with pumice 
simultaneously with, before or after said treatment step. 

16. The method according to claim 12, wherein said 
stonewashing composition comprises truncated cellulase core or 
derivatives thereof. 

17. The method according to claim 16, wherein said 
truncated cellulase core is, selected from among the group 



WO 96/23928 



PCIYUS96/00977 



consisting of EGI type core, EGTI type core, EGV type core, 
CBHI type core, CBHII type core and derivatives thereof. 

18. The method according to claim 16, wherein said 
truncated cellulase core is present in a concentration of 
about 10 to about 400 ppro total protein. 

19. The method according to claim 16, wherein said 
truncated cellulase core is present in a concentration of 
about 20 to about 100 ppm total protein. 

20. The method according to claim 1, wherein said 
treating method comprises washing the cellulose containing 
fabric and said treating composition is a detergent 
composition comprising a surfactant. 

21. The method according to claim 20, wherein said 
surfactant comprises nonionic ethoxylated alkyl phenols or 
nonionic ethokylated alcohols. 

22. The method according to claim 20, wherein said 
treating composition comprises truncated cellulase core or 
derivatives thereof. 

23. The method according to claim 22, wherein said 
truncated cellulase core is selected from the group consisting 
of EGI type core, EGII type core, EGV type core, CBHI type 
core, CBHII type core and derivatives thereof. 

24. The method according to claim 22, wherein said 
truncated cellulase core or derivatives thereof is present in 
a concentration of about 0.1 to about 1000 ppm. 

25. The method according to claim 22, wherein said 
truncated cellulase core or derivatives thereof is present in 
a concentration of about 0.2 to about 500 ppm. 

26. A composition for treating a cellulose containing 
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fabric comprising a truncated cellulase or derivative thereof. 

27. The composition according to claim 26 , wherein said 
truncated cellulase or derivative thereof comprises truncated 
cellulase core. 

5 28. The composition of claim 27, wherein said truncated 

cellulase core is selected from the group consisting of CBHI 
type core, CBH11 type core, EG I type core, EGII type core, EGV 
type core and derivatives thereof. 

29. The composition according to claim 26, wherein said 
10 composition is useful for stonewashing cellulose containing 

fabric and further contains a surfactant. 

30. The composition according to claim 29, wherein said 
truncated cellulase core or derivatives thereof is present in 
a concentration of about 1 to about 1000 ppm total protein. 



15 31. The composition according to claim 26, wherein said 

composition is useful as a detergent and further comprises a 
surfactant. 

32. The composition according to claim 31, wherein said 
surfactant comprises nonionic ethoxylated alkyl phenols or 

20 nonionic ethxylated alcohols. 

33. The composition according to claim 31, wherein said 
truncated cellulase core is present in a concentration of 
about 0.1 to about 1000 ppm total protein. 

34. The composition according to claim 26, wherein said 
25 treating composition is formulated as a pre-soak. 

35. A fabric produced by the process of claim 1. 

36. A fabric produced by the process of claim 12. 
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37. A fabric produced by the process of claim 20 

38. The fabric of claim 35, wherein said fabric 
comprises dyed denim. 
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AAGCTTAGCCAAGAACAATAGCCGATAAAGATAGCCTCATTAAACGGAAT 
GAGCTAGTAGGCAAAGTCAGCGAATGTGTATATATAAAGGTTCGAGGTCC 

' 1 . 1 . 1 , L 10Q 

GTGCCTCCCTCATGCTCTCCCCATCTACTCATCAACTCAGATCCTCCAGG 
■ 1,11 1 1 ' — ■ 150 

AGACTTGTACACCATNTTTTGAGGCACAGAAACCCAATAGTCAACCGCGG 
1 ' ■ 1 1 1 1 ' — — l 200 

ACTGGCATCATGTATCGGAAGTTGGCCGTCATCTCGGCCTTCTTGGCCAC 
. 1 1 1 ' 250 

Met Tyr Arg Lys Leu Ala Val lie Ser Alo Phe Leu Ala Thr 

AGCTCGTGCTCAGTCGGCCTGCACTCTCCAATCGGAGACTCACCCGCCTC 

1 1 ' 1 ' 1 ' ' 300 

Ala Arg Alo Gin Ser Ala Cys Thr Leu Gin Ser Glu Thr His Pro Pro 

TGACATGGCAGAAATGCTCGTrrGGTGGCACTTGCACTCAACAGACAGGC 
1 ' 1 ' — — ' 1 1 1 l 

Leu Thr Trp Gin Lys Cys Ser Ser Gly Gly Thr Cys Thr Gin Gin Thr Gly 
TCCGTGGTCATCGACGCCAACTGGCGCTGGACTCACGCTACGAACAGCAG 

1 1 1 ' 1 ' 1 1 — ■ noo 

Ser Vol Val He Asp Ala Asn Trp Arg Trp Thr His Alo Thr Asn Ser Ser 

CACGAACTGCTACGATGGCAACACTTGGAGCTCGACCCTATGTCCTGACA 

— ■ 1 . 1 1 1 1 ' ' 450 

Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp 
ACGAGACCTGCGCGAAGAACTGCTGTCTGGACGGTGCCGCCTACGCGTCC 

. i 1 1 , 1 , ■ , L gQQ 

Asn Glu Thr Cys Alo Lys Asn Cys Cys Leu Asp Gly Ala Ala Tyr Ala Ser 

ACGTACGGAGTTACCACGAGCGGTAACAGCCTCTCCATTGGCTTTGTCAC 

' ' ' 1 ' ' ' 1 1 550 

Thr Tyr Gly Val Thr Thr Ser Gly Asn Ser Leu Ser lie Gly Phe Val Thr 

CCAGTCTGCGCAGAAGAACGTTGGCGCTCGCCTTTACCTTATGGCGAGCG 
' 1 ' 1 1 1 —j ■ i goo 

Gin Ser Ala Gin Lys Asn Val Gly Ala Arg Leu Tyr Leu Met Ala Ser 

ACACGACCTACCAGGAATTCAcCCTGCTTGGCAACGAGTTCTCTTTCGAT 

— ■ « ■ ' ■ 1 ■ < — i l 650 

Asp Thr Thr Tyr Gin Glu Phe Thr Leu Leu Gly Asn Glu Phe Ser Phe Asp 

GTTGATGTTTCGCAGCTGCCGTAAGTGACTTACCATGAACCCCTGACGTA 

i i i i «■■■■« ' ' 700 

Val Asp Val Ser Gin Leu Pro 

TCTTCTTGTGGGCTCCCAGCTGACTGGCCAATTTAAGGTGCGGCTTGAAC 
, ■ , 1 , . ' ■ 1- 750 



Cys Gly Leu Asn 



GGAGCTCTCTACTTCGTGTCCATGGACGCGGATGGTGGCGTGAGCAAGTA 
, 1 , ' 1 800 

Gly Alo Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Val Ser Lys Tyr 
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TCCCACCAACACCGCTGGCGCCAAGTACGGCACGGGGTACTGTGACA6CC 

Pro Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser 

AGTGTCCCCGCGATCTGAAGTTCATCAATGGCCAGGCCAACGTTGAGGGC 

....... i ...... i . » . . i . . i 

Gin Cys Pro Arg Asp Leu Lys Phe He Asn Gly Gin Ala Asn Val Glu Gly 

TGGGAGCCGTCATCCAACAACGCAAACACGGGCATTGGAGGACACGGAAG 

i • * i * • 1 

Trp Glu Pro Ser Ser Asn Asn Alo Asn Thr Gly He Gly Gly His Gly Ser 
CTGCTGCTCTGAGATGGATATCTGGGAGGCCAACTCCATCTCCGAGGCTC 



850 



900 



950 



1000 



1050 



1 100 



1150 



1200 



Cys Cys Ser Glu Met Asp He Trp Glu Ala Asn Ser He Ser Glu Ala 
TTACCCCCCACCCTTGCACGACTGTCGGCCAGGAGATCTGCGAGGGTGAT 

Leu Thr Pro His Pro Cys Thr Thr Val Gly Gin Glu He Cys Glu Gly Asp 
GGGTGCGGCGGAACTTACTCCGATAACAGATATGGCGGCACTTGCGATCC 

. . i ■ - ■ - » I .... i . . l .«■■■■! ■■ i .... 1 

Gly Cys Gly Gly Thr Tyr Ser Asp Asn Arg Tyr Gly Gly Thr Cys Asp Pro 
CGATGGCTGCGACTGGAACCCATACCGCCTGGGCAACACCAGCTTCTACG 

Asp Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr Ser Phe Tyr 
GCCCTGGCTCAAGCTTTACCCTCGATACCACCAAGAAATTGACCGTTGTC 

I - - i - . - . . i I .... i .... i 

Gly Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys Leu Thr Val Val 

ACCCAGTTCGAGACGTCGGGTGCCATCAACCGATACTATGTCCAGAATGG 
• » 1 L 1250 

Thr Gin Phe Glu Thr Ser Gly Ala lie Asn Arg Tyr Tyr Val Gin Asn Gly 

CGTCACTTTCCAGCAGCCCAACGCCGAGCTTGGTAGTTACTCTGGCAACG 
. i . . — i ■ « « ' > L 1300 

Val Thr Phe Gin Gin Pro Asn Ala Glu Leu Gly Ser Tyr Ser Gly Asn 

AGCTCAACGATGATTACTGCACAGCTGAGGAGGCAGAATTCGGCGGATCC 
■ ■ .... i i ' ' 1 1350 

Glu Leu Asn Asp Asp Tyr Cys Thr Ala Glu Glu Ala Glu Phe Gly Gly Ser 

TCTTTCTCAGACAAGGGCGGCCTGACTCAGTTCAAGAAGGCTACCTCTGG 

i i i ■ 1 ' ■ — ' ' 1 1 L 1400 

Ser Phe Ser Asp Lys Gly Gly Leu Thr Gin Phe Lys Lys Ala Thr Ser Gly 
CGGCATGGTTCTGGTCATGAGTCTGTGGGATGATGTGAGTTTGATGGACA 

■ . 1 ■ ■ ■ • L 1450 



Gly Met Val Leu Val Met Ser Leu Trp Asp Asp 
AACATGCGCGTTGACAAAGAGTCAAGCAGCTGACTGAGATGTTACAGTAC 



1500 



— Tyr 

TACGCCAACATGCTGTGGCTGUACTCCACCTACCCGACAAACGAGACCTC 

Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Asn Glu Thr Ser 
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1600 



1650 



1700 



1750 



1800 
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CTCCACACCCGGTGCCGTGCGCGGAAGCTGCTCCACCAGCTCCGGTGTCC 

I .... i ■ ... I t . i . , 

Ser Thr Pro Gly Ala Vol Arg Gly Ser Cys Ser Thr Ser Ser Gly Vol 
CTGCTCAGGTCGAATCTCAGTCTCCCAACGCCAAGGTCACCTTCTCCAAC 

Pro Alo Gin Va! Glu Ser Gin Ser Pro Asn Ala Lys Val Thr Phe Ser Asn 
ATCAAGTTCGGACCCATTGGCAGCACCGGCAACCCTAGCGGCGGCAACCC 

• 1 i i . ... i ,.i . . . ! 

lie Lys Phe Gly Pro He Gly Ser Thr Gly Asn Pro Ser Gly Gly Asn Pro 

TCCCGGCGGAAACCGTGGCACCACCACCACCCGCCGCCCAGCCACTACCA 

■ ■ ■ 1 ■ ■ ...... i ..... . . » . « 

Pro Gly Gly Asn Arg Gly Thr Thr Thr Thr Arg Arg Pro Ala Thr Thr 

CTGGAAGCTCTCCCGGACCTACCCAGTCTCACTACGGCCAGTGCGGCGGT 

» « 1 ...... i . . . i ....... i 

Thr Gly Ser Ser Pro Gly Pro Thr Gin Ser His Tyr Gly Gin Cys Gly Gly 
ATTGGCTACAGCGGCCCCACGGTCTGCGCCAGCGGCACAACTTGCCAGGT 

lie Gly Tyr Ser Gly Pro Thr Val Cys Ala Ser Gly Thr Thr Cys Gin Val 

CCTGAACCCTTACTACTCTCAGTGCCTGTAAAGCTCCGTGCGAAAGCCTG 
. ' 1 ' ' L - ■ ' ■ l j 900 

Leu Asn Pro Tyr Tyr Ser Gin Cys Leu • 

ACGCACCGGTAGATTCTTGGTGAGCCCGTATCATGACGGCGGCGGGAGCT 
. ' « 1 ' « • ' ■ ■ i 1950 

ACATGGCCCCGGGTGATTTATTTTTTTTGTATCTACTTCTGACCCTTTTC 

— . 1 * . 1 , — — ^ i - 2000 

AAATATACGGTCAACTCATCTTTCACTGGAGATGCGGCCTGCTTGGTATT 

- — ■ — ' • ' « 1 ■ ■ ■ 1 ■ 2050 

GCGATGTTGTCAGCTTGGCAAATTGTGGCTTTCGAAAACACAAAACGATT 

— — ' ■ ■ — — > ■ 1 ■ ■ ■ l 2100 

CCTTAGTAGCCATGCATTTTAAGATAACGGAATAGAAGAAAGAGGAAATT 
. 1 ^ 1 . 1 . 1 , l 2150 

AAAAAAAAAAAAAAAACAAACATCCCGTTCATAACCCGTAGAATCGCCGC 
i i ... i i .. i i i i i i 2200 



1850 



TCTTCGTGTATCCCAGTACCA 
, , , 2221 



FIG.-1C 
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GAATTCTAGGCTAGGTATGCGAGGCACGCGGATCTAGGGCAGACTGGGCA 

TTGCATAGCTATGGTGTAGTAGAACTCCCGTCAACGGCTATTCTCACCTA 

- 1— — i ■ ■ i ' 100 

GACTTTCCCCTTCGAACTGACAAGTTGTTATATTGCCTGTGTACCAAGCG 

• 1 i ■ ■ ■ ■ ■ ' ■ * » — ■ ■ ■ 1 1 50 

CTAATGTGGACAGGATTAATGCCAGAGTTCATTAGCCTCAAGTAGAGCCT 

■ ■ - i ■ ■ ■ i ■ ■ t * 1 ii i ' 200 

ATTTCCTCGCCGGAAAGTCATCTCTCTTATTGCATTTCTGCCTTCCACTA 
i i i i i i i i i ■ » ' — L 250 

ACTCAGGGTGCAGCGCAACACTACACGCAACATATCACATTTATTAGCCG 

■ ... i 1 u. ■ ■ 1 ■ ■ 1 300 

TGCAACAAGGCTATTCTACGAAAAATGCTACACTCCACATGTTAAAGGCG 

i ' 1 350 

CATTCAACCAGCTTCTTTATTGGGTAATATACAGCCAGGCGGGGATGAAG 

■ i i ■ ■ i ■ ■ i i i ■ i i , , ■ , i — . — i xjoO 

CTCATTAGCCGCCACTCAAGGCTATACAATGTTGCCAACTCTCCGGGCTT 
i i-. . ■ ■ i » ■ ■ ■ ■ i ' ■ ■ ■ l /J50 

TATCCTGTGCTCCCGAATACCACATCGTGATGATGCTTCAGCGCACGGAA 
■ i — i ■ ■ ■ ' • 1 ■ — «- 500 

GTCACAGACACCGCCTGTATAAAAGGGGGACTGTGACCCTGTATGAGGCG 
i ■ • ' » -J- 550 

CAACATGGTCTCACAGCAGCTCACCTGAAGAGGCTTGTAAGATCACCCTC 
i i ■ ■ i ■ ■ i ■ ■ i i » ■ ■ i ' 600 

TGTGTATTGCACCATGATTGTCGGCATTCTCACCACGCTGGCTACGCTGG 
, 1 , i ■ ' » L 650 

Met He Val Gly He Leu Thr Thr Leu Ala Thr Leu 

CCACACTCGCAGCTAGTGTGCCTCTAGAGGAGCGGCAAGCTTGCTCAAGC 
i ■ > . ■ i ■ ■ ' ■ ■ ' ■ — 1 700 

Ala Thr Leu Ala Ala Ser Val Pro Leu Glu Glu Arg Gin Ala Cys Ser Ser 

GTCTGGTAATTATGTGAACCCTCTCAAGAGACCCAAATACTGAGATATGT 

, i ■ i 1 1 ' ' L 750 

Val Trp— ~" 

CAAGGGGCCAATGTGGTGGCCAGAATTGGTCGGGTCCGACTTGCTGTGCT 

• ■ ■ ■ 1 * iii ■ * — ■ ■ ■ i BOO 



Gly Gin Cys Gly Gly Gin Asn Trp Ser Gly Pro Thr Cys Cys Ala 



TCCGGAAGCACATGCGTCTACTCCAACGACTATTACTCCCAGTGTCTTCC 

- i -j ' 1 — 1 850 

Ser Gly Ser Thr Cys Val Tyr Ser Asn Asp Tyr Tyr Ser Gin Cys Leu Pro 
CGGCGCTGCAAGCTCAAGCTCGTCCACGCGCGCCGCGTCGACGACTTCTC 

• 1 1 ■ ' 1 1 ■ — 1 L 900 

Gly Ala Ala Ser Ser Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser 
GAGTATCCCCCACAACATCCCGGTCGAGCTCCGCGACGCCTCCACCTGGT 
- . 1 » — 1 ' 950 

Arg Val Ser Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro Gly 
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TCTACTACTACCAGAGTACCTCCAGTCGGATCGGGAACCGCTACGTATTC 
. . i i . 1 , ■ ' ■ 1000 

Ser Thr Thr Thr Arg Vol Proj£o Vol Gly Ser Gly Thr Ala Thr Tyr Ser 
AGGCAACCCTTTTGTTGGGGTCACTCCTTGGGCCAATGCATATTACGCCT 



■ - ■ ■ 



1050 



Gly Asn Pro Phe Val Gly Vol Thr Pro Trp Ala Asn Ala Tyr Tyr Ala 

CTGAAGTTAGCAGCCTCGCTATTCCTAGCTTGACTGGAGCCATGGCCACT 

■ i ... ■ i ........ i . .... i . ■ 

Ser Glu Val Ser Ser Leu Ala lie Pro Ser Leu Thr Gly Ala Met Ala Thr 
GCTGCAGCAGCTGTCGCAAAGGTTCCCTCTTTTATGTGGCTGTAGGTCCT 



1100 



1150 



Ala Ala Ala Ala Val Ala Lys Vol Pro Ser Phe Met Trp Le u 
CCCGGAACCAAGGCAATCTGTTACTGAAGGCTCATCATTCACTGCAGAGA 

• i - ... i i . . < , , i . .... i 

" Asp 

TACTCTTGACAAGACCCCTCTCATGGAGCAAACCTTGGCCGACATCCGCA 

* i ......... i i ..... . i 

Thr Leu Asp Lys Thr Pro Leu Met Glu Gin Thr Leu Ala Asp He Arg 

CCGCCAACAAGAATGGCGGTAACTATGCCGGACAGTTTGTGGTGATAGAC 

■ ■■».-■■ i . . . i . ....... . i ........ i . . . 

Thr Ala Asn Lys Asn Gly Gly Asn Tyr Ala Gly Gin Phe Val Val He Asp 
TTGCCGGATCGCGATTGCGCTGCCCTTGCCTCGAATGGCGAATACTCTAT 

■ i i ■ I ■ ■ I .i I .... A .... I 

Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr Ser He 
TGCCGATGGTGGCGTCGCCAAATATAAGAACTATATCGACACCATTCGTC 

Ala Asp Gly Gly Val Ala Lys Tyr Lys Asn Tyr lie Asp Thr He Arg 
AAATTGTCGTGGAATATTCCGATATCCGGACCCTCCTGGTTATTGGTATG 



1200 



1250 



1300 



1350 



1100 



1150 



Gin He Val Val Glu Tyr Ser Asp He Arg Thr Leu Leu Val He 

AGTTTAAACACCTGCCTCCCCCGCCCCTTCCCTTCCTTTCCCGCCGGCAT 

■ ■ • - . . . ■ ... . ■ - i . ■ 



CTTGTCGTTGTGCTAACTATTGTTCCCTCTTCCAGAGCCTGACTCTCTTG 

' ■ ■ > t ..... . . t ......... i ........ . i 



1500 



1550 



"Glu Pro Asp Ser Leu 



CCAACCTGGTGACCAACCTCGGTACTCCAAAGTGTGCCAATGCTCAGTCA 

■ ■ < 1 1 ' 1 1 • 1600 



Ala Asn Leu Val Thr Asn Leu Gly Thr Pro Lys Cys Ala Asn Ala Gin Ser 
GCCTACCTTGAGTGCATCAACTACGCCGTCACACAGCTGAACCTTCCAAA 



1650 



Ala Tyr Leu Glu Cys He Asn Tyr Ala Val Thr Gin Leu Asn Leu Pro Asn 

TGTTGCGATGTATTTGGACGCTGGCCATGCAGGATGGCTTGGCTGGCCGG 
, 1 , 1 , , , 1 , l. j 

Val Ala Met Tyr Leu Asp Ala Gly His Ala Gly Trp Leu Gly Trp Pro 
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CAAACCAAGACCCGGCCGCTCAGCTATTTGCAAATGTTTACAAGAATGCA 
, , , , , , , , , l 175 q 

Ala Asn Gin Asp Pro Ala Ala Gin Leu Phe Ala Asn Val Tyr Lys Asn Ala 

TCGTCTCCGAGAGCTCTTCGCGGATTGGCAACCAATGTCGCCAACTACAA 
, , , 1 , , , , , l 18 00 

Ser Ser Pro Arg Ala Leu Arg Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn 

CGGGTGGAACATTACCAGCCCCCCATCGTACACGCAAGGCAACGCTGTCT 
, 1 . . . . . . , , i , , 185 q 

Gly Trp Asn lie Thr Ser Pro Pro Ser Tyr Thr Gin Gly Asn Ala Val 

ACAACGAGAAGCTGTACATCCACGCTATTGGACCTCTTCTTGCCAATCAC 
1 1 . 1 . 1 , 1 i j 900 

Tyr Asn Glu Lys Leu Tyr He His Ala lie Gly Pro Leu Leu Ala Asn His 

GGCTGGTCCAACGCCTTCTTCATCACTGATCAAGGTCGATCGGGAAAGCA 
. 1 . 1 . 1 . ■ ■ 1950 

Gly Trp Ser Asn Ala Phe Phe lie Thr Asp Gin Gly Arg Ser Gly Lys Gin 

GCCTACCGGACAGCAACAGTGGGGAGACTGGTGCAATGTGATCGGCACCG 

1 1 ■ — 1 1 1 — 1 — ■ ' L 2000 

Pro Thr Gly Gin Gin Gin Trp Gly Asp Trp Cys Asn Va! He Gly Thr 

GATTTGGTATTCGCCCATCCGCAAACACTGGGGACTCGTTGCTGGATTCG 

- ' ' 1 ' ' ■ 1 ■ 2050 

Gly Phe Gly lie Arg Pro Ser Ala Asn Thr Gly Asp Ser Leu Leu Asp Ser 

TTTGTCTGGGTCAAGCCAGGCGGCGAGTGTGACGGCACCAGCGACAGCAG 
. 1 . 1 . 1 . 1 , l 2100 

Phe Val Trp Val Lys Pro Gly Gly Glu Cys Asp Gly Thr Ser Asp Ser Ser 
TGCGCCACGATTTGACTCCCACTGTGCGCTCCCAGATGCCTTGCAACCGG 



2150 



Ala Pro Arg Phe Asp Ser His Cys Ala Leu Pro Asp Ala Leu Gin Pro 
CGCCTCAAGCTGGTGCTTGGTTCCAAGCCTACTTTGTGCAGCTTCTCACA 

Ala Pro Gin Ala Gly Ala Trp Phe Gin Ala Tyr Phe Val Gin Leu Leu Thr 

AACGCAAACCCATCGTTCCTGTAAGGCTTTCGTGACCGGGCTTCAAACAA 

- - - - » - - i . . . . » i . . ... .1 . . i ...... t 

Asn Ala Asn Pro Ser Phe Leu • 

TGATGTGCGATGGTGTGGTTCCCGGTTGGCGGAGTCTTTGTCTACTTTGG 

> • i ........ i . . i . i 

TTGTCTGTCGCAGGTCGGTAGACCGCAAATGAGCAACTGATGGATTGTTG 
CCAGCGATACTATAATTCACATGGATGGTCTTTGCGATCAGTAGCTAGTG 

» i i ■ i . i . .... i 

AGAGAGAGAGAACATCTATCCACAATGTCGAGTGTCTATTAGACATACTC 

CGAGAATAAAGTCAACTGTGTCTGTGATCTAAAGATCGATTCGGCAGTCG 

AGTAGCGTATAACAACTCCGAGTACCAGCAAAAGCACGTCGTGACAGGAG 

CAGGCTTTGCCAACTGCGCAACCTTGCTTGAATGAGGATACACGGGGTGC 
, i . — ..i 1 . i i i . . i . i i 



2200 



2250 



2300 
2350 
2100 
2150 
2500 
2550 
2600 
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AACATGGCTGTACTGATCCATCGCAACCAAAATTTCTGTTTATAGATCAA 

■ ■■■»■■-. i .... ■ ... i — . . * . . i . ... i . | 26S0 

GCTGGTAGATTCCAATTACTCCACCTCTTGCGCTTCTCCATGACATGTAA 
, 1 1 i— , , ■ , , l 2700 

GTGCACGTAGGAAACCATACCCAAATTGCCTACAGCTGCGGAGCATGAGC 
, 1 — ' ' ■ " ■ L 2750 

CTATGGCGATCAGTCTGGTCATGTTAACCAGCCTGTGCTCTGACGTTAAT 
, i , , , , , , , L 2800 

GCAGAATAGAAAGCCGCGGTTGCAATGCAAATGATGATGCCTTTGCAGAA 
. , , , , , , , , l 2850 

ATGGCTTGCTCGCTGACTGATACCAGTAACAACTTTGCTTGGCCGTCTAG 
. 1 1 1 1 ■ 2900 

CGCTGTTGATTGTATTCATCACAACCTCGTCTCCCTCCTTTGGGTTGAGC 
, ' 1 > ' • ' 1 ■ l 2950 

TCTTTGGATGGCTTTCCAAACGTTAATAGCGCGTTTTTCTCCACAAAGTA 
. 1 , • ' ' 3000 

TTCGTATGGACGCGCTTTTGGCTGTATTGCGTGAGCTACCAGCAGCCCAA 
■ 1 . 1 , , , 1 , ! , l 3050 

TTGGCGAAGTCTTGAGCCGCACTCGCATAGAATAATTGATTGCGCATTTG 
. 1 ' ■ ■ «- 3100 

ATGCGATTTTTGAGCGGCTGTTTCAGGCGACATTTCGCCGCCTTTATTTG 
, , , 1 — , , , , l 3150 

CTCCATTATATCATCGATGGCATGTCCAATAGCCCGGTGATAGTCTTGTC 

— i 1 i i 3200 

GAA7ATGGCTGTCGTGGATAACCCATCGGCAGCAGATGATAATGATTCCG 
, ■ • ' ■ 1 ' 3250 

CAGCACAAGCTCGTATGTGGGTAGCAGAAGAACTGAGCGAGATCTTCGAG 
. 1 . 1 , 1 , ' 3300 

GGCGTAACTCTGCATATCCGATTGGCCTGCTGCCACATGTCATTTTGCTT 
, , , , , ■ ' . l 3350 

CGGTTTCTTTTCTGTTGAGTTCTTGTATTTGGGTGAAAGTAACATGGTGT 
. 1 ' 1 — - — ' ' ■ l 3x400 

ATGACGAGAGACATTGGTGGTAAGAAAAAATTTCACCTCCTCTTAGTGCA 
, 1 . 1 . 1 . 1 . l 3/J50 

GGACTGACTCTCAAAATCTATATGCAAATGTGTCGTGTAACACCCTTCGC 
L- ' ■ 1 ■ 1 ■ 3500 

ATGAGCGCTGACCGTACCCTACCATTTCGCCCCACTCATGATAGCAGAAG 
, , , 1 , , , ' 3550 

AGACATATTAATTCGGCAATGCTACGAAAGTCTGCAGGCTATGCTTAAAT 
, , 1 . 1 . 1 1 l 36OO 

AAACGCTTGCCACAGAAGCCGACAGTTTATTGTTACTACTTACTATACTG 
1 1 < 1 1 1 ■ 1 1 3650 

TATTATTGTTGCTCACATAAGGCGGTGAACCATTGGTTCACACGACGCCT 
■ 1 ■ > ■ ' ■ ' ■ «- 3700. 

GACGAGGTAAATTACTCTCTCGTAGGGCTGCCAAGGTAGGTCCCAACCCC 
, i , -1— — — _i_ , 1 , l 3 7 5Q 

GTATCCTCGGTCGAGGGTGCGAGGTTCTTTGGTCCTTCCCTCTTTGGTAA 
. 1 ■ 1 . 1 ■ l 3800 
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AGCCCAGTAGCGTGTTTGAATCAGTTCACAATCTCTCCTAAACACAGTCC 

■ ■ i . 1 ' — ■ " « *■ 3850 

GACACTAGGTAGGTACGTTGTAATAGCAACTCAAACATGTAATTCGTTTC 

■ ' ■ ' 3900 

AAGGCAGGAACATTTTATAAACTTCCCTGCGATTTAATCAATAAAGATCC 

i ........ i i i . . . i ■ i . ■ . i 3950 

TAGTCCAATCGTATACTACCTACCTAGCTAAGGTAGGTAGGTAGTTCGTG 
i i ■ i ■ ■ 1 ' 1- 4000 

GGAACCTGGTCGCTAATTCACGCAACCCACTTTGCGCTCTTCGCCTGGCC 



4050 



GTCGTTGAAGGTAAAGCAGTTGTACCCATCACCTAACTCAACCGACACCG 



TTGATCTGCTCAAGGCAGTTTTC 
, ■ . 1 » i|i23 



FIG..2E 



FIG. _2 



a 100 
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TGTGTTGAAATCCAACTTATAAAGACAACAACCGCAAACTTTGTCTTGTG 
I 1 1 1 1 1 ' 1 ' 50 

CCATCAGATTGTTGCCAAGCACCGTCCCCCCCCCCTATCTTAGTCCTTCT 

* . . . . i ...... i i ... i . i 



100 
150 



200 



250 



300 



350 



100 



150 



TGTTGTCCCAAAATGGCGCCCTCAGTTACACTGCCGTTGACCACGGCCAT 

Met Ala Pro Ser Vol Thr Leu Pro Leu Thr Thr Ala lie 
CCTGGCCATTGCCCGGCTCGTCGCCGCCCAGCAACCGGGTACCAGCACCC 

Leu Ala He Ala Arg Leu Vol Ala Ala Gin Gin Pro Gly Thr Ser Thr 

CCGAGGTCCATCCCAAGTTGACAACCTACAAGTGTACAAAGTCCGGGGGG 

■ * * .... . . i ...... i . ... i ......... i 

Pro Glu Val His Pro Lys Leu Thr Thr Tyr Lys Cys Thr Lys Ser Gly Gly 
TGCGTGGCCCAGGACACCTCGGTGGTCCTTGACTGGAACTACCGCTGGAT 

■ • • ■ I ■ ' I i .... A .... I ...... I 

Cys Val Ala Gin Asp Thr Ser Vol Val Leu Asp Trp Asn Tyr Arg Trp Met 

GCACGACGCAAACTACAACTCGTGCACCGTCAACGGCGGCGTCAACACCA 

« - > - ■ - » - . . . i ... . .... i ..... ■ . i 

His Asp Ala Asn Tyr Asn Ser Cys Thr Val Asn Gly Gly Val Asn Thr 

CGCTCTGCCCTGACGAGGCGACCTGTGGCAAGAACTGCTTCATCGAGGGC 

» . . i ..... .... i _ .... ... i .... ... i . . i 

Thr Leu Cys Pro Asp Glu Ala Thr Cys Gly Lys Asn Cys Phe lie Glu Gly 

GTCGACTACGCCGCCTCGGGCGTCACGACCTCGGGCAGCAGCCTCACCAT 
i .... i .... i i i i 

Val Asp Tyr Ala Ala Ser Gly Val Thr Thr Ser Gly Ser Ser Leu Thr Met 
GAACCAGTACATGCCCAGCAGCTCTGGCGGCTACAGCAGCGTCTCTCCTC 

Asn Gin Tyr Met Pro Ser Ser Ser Gly Gly Tyr Ser Ser Val Ser Pro 

GGCTGTATCTCCTGGACTCTGACGGTGAGTACGTGATGCTGAAGCTCAAC 

■ ■ i i ... i i i i i i i t .... i i 

Arg Leu Tyr Leu Leu Asp Ser Asp Gly Glu Tyr Val Met Leu Lys Leu Asn 

GGCCAGGAGCTGAGCTTCGACGTCGACCTCTCTGCTCTGCCGTGTGGAGA 
i i .... i .... i i . , i , i 

Gly Gin Glu Leu Ser Phe Asp Val Asp Leu Ser Ala Leu Pro Cys Gly Glu 

GAACGGCTCGCTCTACCTGTCTCAGATGGACGAGAACGGGGGCGCCAACC 

■ ■ . . i .... i .... ■ . . 

Asn Gly Ser Leu Tyr Leu Ser Gin Met Asp Glu Asn Gly Gly Ala Asn 

AGTATAACACGGCCGGTGCCAACTACGGGAGCGGCTACTGCGATGCTCAG 

i . i . < . i . i 

Gin Tyr Asn Thr Ala Gly Ala Asn Tyr Gly Ser Gly Tyr Cys Asp Ala Gin 

TGCCCCGTCCAGACATGGAGGAACGGCACCCTCAACACTAGCCACCAGGG 
, , , , 1 . 1 , l 750 

Cys Pro Val Gin Thr Trp Arg Asn Gly Thr Leu Asn Thr Ser His Gin Gly 

CTTCTGCTGCAACGAGATGGATATCCTGGAGGGCAACTCGAGGGCGAATG 
■ ' 1 ■ ' < >- 800 

Phe Cys Cys Asn Glu Met Asp He Leu Glu Gly Asn Ser Arg Ala Asn 



500 



550 



600 



650 



700 
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CCTTGACCCCTCACTCTTGCACGGCCACGGCCTGCGACTCTGCCGGTTGC 

■ - ■ ■ ■ i ...... I .. i ...... . i 

Ala Leu Thr Pro His Ser Cys Thr Ala Thr Ala Cys Asp Ser Ala Gly Cys 
GGCTTCAACCCCTATGGCAGCGGCTACAAAAGGTGAGCCTGATGCCACTA 

- - - . . . . . I . . _ _ . . . . . I A ......... 1 . ....... I 



650 



900 



Gly Phe Asn Pro Tyr Gly Ser Gly Tyr Lys Se i 

CTACCCCTTTCCTGGCGCTCTCGCGGTTTTCCATGCTGACATGGTTTTCC 



950 



1000 



1050 



1100 



1150 



1200 



1250 



AGCTACTACGGCCCCGGAGATACCGTTGACACCTCCAAGACCTTCACCAT 

- - - - i ■ » . i ... i . . i 

— Tyr Tyr Gly Pro Gly Asp Thr Val Asp Thr Ser Lys Thr Phe Thr He 
CATCACCCAGTTCAACACGGACAACGGCTCGCCCTCGGGCAACCTTGTGA 

He Thr Gin Phe Asn Thr Asp Asn Gly Ser Pro Ser Gly Asn Leu Val 

GCATCACCCGCAAGTACCAGCAAAACGGCGTCGACATCCCCAGCGCCCAG 

i i i i i i i i i i i i i i i i ■■■ i i i i 

Ser lie Thr Arg Lys Tyr Gin Gin Asn Gly Val Asp He Pro Ser Ala Gin 

CCCGGCGGCGACACCATCTCGTCCTGCCCGTCCGCCTCAGCCTACGGCGG 

* . ... i i t ........ i .... i 

Pro Gly Gly Asp Thr He Ser Ser Cys Pro Ser Ala Ser Ala Tyr Gly Gly 

CCTCGCCACCATGGGCAAGGCCCTGAGCAGCGGCATGGTGCTCGTGTTCA 
.... i ■ . i .... i .... i i i i 

Leu Ala Thr Met Gly Lys Ala Leu Ser Ser Gly Met Val Leu Val Phe 

GCATTTGGAACGACAACAGCCAGTACATGAACTGGCTCGACAGCGGCAAC 

. - - - » - ■ ... i ..... i . t ......... % 

Ser He Trp Asn Asp Asn Ser Gin Tyr Met Asn Trp Leu Asp Ser Gly Asn 

GCCGGCCCCTGCAGCAGCACCGAGGGCAACCCATCCAACATCCTGGCCAA 

. . . . . . . i . t i ...... i .... i 

Ala Gly Pro Cys Ser Ser Thr Glu Gly Asn Pro Ser Asn He Leu Ala Asn 

CAACCCCAACACGCACGTCGTCTTCTCCAACATCCGCTGGGGAGACATTG 

. . . » - . i . i ...... t ..... i 

Asn Pro Asn Thr His Val Val Phe Ser Asn He Arg Trp Gly Asp He 
GGTCTACTACGAACTCGACTGCGCCCCCGCCCCCGCCTGCGTCCAGCACG 

• ' ...... I A .... I I ■■■■»■■■■ I 

Gly Ser Thr Thr Asn Ser Thr Ala Pro Pro Pro Pro Pta Ala Ser Ser Thr 

ACGTTTTCGACTACACCGAGGAGCTCGACGACTTCGAGCAGCCCGAGCTG 
, , — i ■ . — ■ « ■ 1 — L 1450 

Thr Phe Ser Thr Thr Pro Arg Ser Ser Thr Thr Ser Ser Ser Pro Ser Cys 

CACGCAGACTCACTGGGGGCAGTGCGGTGGCATTGGGTACAGCGGGTGCA 
, ^ , 1 . 1 . 1 * l j 500 

Thr Gin Thr His Trp Gly Gin Cys Gly Gly He Gly Tyr Ser Gly Cys 

AGACGTGCACGTCGGGCACTACGTGCCAGTATAGCAACGACTGTTCGTAT 
, i > 1 ■ ■ • l 1550 



1300 



1350 



1400 



Lys Thr Cys Thr Ser Gly Thr Thr Cys Gin Tyr Ser Asn Asp 
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CCCCATGCCTGACGGGAGTGATTTTGAGATGCTAACCGCTAAAATACAGA 

■ ' ■ 1 > — — 1 — 1 — L 1600 



: ^ Tyr 

CTACTCGCAATGCCTTTAGAGCGTTGACTTGCCTCTGGTCTGTCCAGACG 



Tyr Ser Gin Cys Leu • 
GGGGCACGATAGAATGCGGGCACGCAGGGA 



*• 1680 
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1650 



FIG.-3A 
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TGCCATTTCTGACCTGGATAGGTTTTCCTATGGTCATTCCTATAAGAGAC 

| ... I I 1 1 , I . 1 , I gQ 

ACGCTCTTTCGTCGGCCCGTAGATATCAGATTGGTATTCAGTCGCACAGA 

1 • 1 i 1 ■ 1 i ■ \QQ 

CGAAGGTGAGTTGATCCTCCAACATGAGTTCTATGAGCCCCCCCCTTGCC 

150 
200 
250 
300 



i - - ■ - » - - i • ■ 



350 



400 



CCCCCCCGTTCACCTTGACCTGCAATGAGAATCCCACCTTTTACAAGAGC 

i i i i I i » - - | - - - « - - - - « | r t - » - - « ... f < .... | ill 

ATCAAGAAGTATTAATGGCGCTGAATAGCCTCTGCTCGATAATATCTCCC 

1 • ■ ■ ■ i ■ - ' 

CGTCATCGACAATGAACAAGTCCGTGGCTCCATTGCTGCTTGCAGCGTCC 

- ■ ■ ■ ■ ■ ■ 1 * ' - - ■ i .... - ■ 

Met Asn Lys Ser Vol Ala Pro Leu Leu Leu Ala Ala Ser 
ATACTATATGGCGGCGCCGTCGCACAGCAGACTGTCTGGGGCCAGTGTGG 

- 1f[ . ■ ... I .... 1 ... ■ | I .... . . . . _ i . - ....... I . ... ^ . . - -I 

Fie Leu Tyr Gly Gly Ala Val Ala Gin Gin Thr Val Trp Gly Gin Cys Gly 
AGGTATTGGTTGGAGCGGACCTACGAATTGTGCTCCTGGCTCAGCTTGTT 

. t ........ t ...... i ■ » 

Gly He Gly Trp Ser Gly Pro Thr Asn Cys Ala Pro Gly Ser Ala Cys 

CGACCCTCAATCCTTATTATGCGCAATGTATTCCGGGAGCCACTACTATC 

1 i - - 1 --------- » - ■ . _ i 

Ser Thr Leu Asn Pro Tyr Tyr Alo Gin Cys lie Pro Gly Ala Thr Thr He 

ACCACTTCGACCCGGCCACCATCCGGTCCAACCACCACCACCAGGGCTAC 
, ' ■ ' ' ' — ' *■ 500 

Thr Thr Ser Thr Arg Pro Pro Ser Gly Pro Thr Thr Thr Thr Arg Ala Thr 

CTCAACAAGCTCATCAACTCCACCCACGAGCTCTGGGGTCCGATTTGCCG 
. 1 . 1 . 1 . 1 550 

Ser Thr Ser Ser Ser Thr Pro Pro Thr Ser Ser Gly Val Arg Phe Ala 

GCGTTAACATCGCGGGTTTTGACTTTGGCTGTACCACAGAGTGAGTACCC 
, , , , , , , , ■ . ■ i 600 



450 



Gly Val Asn He Ala Gly Phe Asp Phe Gly Cys Thr Thr Asp 

TTGTTTCCTGGTGTTGCTGGCTGGTTGGGCGGGTATACAGCGAAGCGGAC 



650 



GCAAGAACACCGCCGGTCCGCCACCATCAAGATGTGGGTGGTAAGCGGCG 

' ... i i . . ■»■.■..■■■■■ 



GTGTTTTGTACAACTACCTGACAGCTCACTCAGGAAATGAGAATTAATGG 

. . . . ■ i.i- . .... l .... * . t ......... i 



700 



750 



AAGTCTTGTTACAGTGGCACTTGCGTTACCTCGAAGGTTTATCCTCCGTT 
, _j . 1 . 1 > 1 , l 800 



Gly Thr Cys Val Thr Ser Lys Val Tyr Pro Pro Leu 



GAAGAACTTCACCGGCTCAAACAACTACCCCGATGGCATCGGCCAGATGC 
1 i . ■ i ■ 1 ■ " ■ l 850 

Lys Asn Phe Thr Gly Ser Asn Asn Tyr Pro Asp Gly He Gly Gin Met 
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AGCACTTCGTCAACGAGGACGGGATGACTATTTTCCGCTTACCTGTCGGA 

1 1 ' 1 1 » ■ ■ . ■ i 900 

Gin His Phe Vol Asn Glu Asp Gly Met Thr He Phe Arg Leu Pro Vol Gly 

TGGCAGTACCTCGTCAACAACAATTTGGGCGGCAATCTTGATTCCACGAG 
■ 1 ■ ■ 950 

Trp Gin Tyr Leu Vol Asn Asn Asn Leu Gly Gly Asn Leu Asp Ser Thr Ser 

CATTTCCAAGTATGATCAGCTTGTTCAGGGGTGCCTGTCTCTGGGCGCAT 

■ i , ... i ...... t ... . 

He Ser Lys Tyr Asp Gin Leu Vol Gin Gly Cys Leu Ser Leu Gly Ala 

ACTGCATCGTCGACATCCACAATTATGCTCGATGGAACGGTGGGATCATT 

, , , , i i ...... 

Tyr Cys He Vol Asp He His Asn Tyr Ala Arg Trp Asn Gly Gly He lie 
GGTCAGGGCGGCCCTACTAATGCTCAATTCACGAGCCTTTGGTCGCAGTT 

■ ■ m 



1000 



1050 



1100 



Gly Gin Gly Gly Pro Thr Asn Ala Gin Phe Thr Ser Leu Trp Ser Gin Leu 

GGCATCAAAGTACGCATCTCAGTCGAGGGTGTGGTTCGGCATCATGAATG 
— — 1 . 1 ■ « 1 l j 

Ala Ser Lys Tyr Ala Ser Gin Ser Arg Val Trp Phe Gly He Met Asn 

AGCCCCACGACGTGAACATCAACACCTGGGCTGCCACGGTCCAAGAGGTT 
1 1 ' ' ' 1 ■ ■ ■ ■ ' 1200 

Glu Pro His Asp Val Asn lie Asn Thr Trp Ala Ala Thr Val Gin Glu Val 

GTAACCGCAATCCGCAACGCTGGTGCTACGTCGCAATTCATCTCTTTGCC 

' — ' ' 1 ' 1 ■ ' ■ 1250 

Val Thr Ala He Arg Asn Ala Gly Ala Thr Ser Gin Phe lie Ser Leu Pro 

TGGAAATGATTGGCAATCTGCTGGGGCTTTCATATCCGATGGCAGTGCAG 

Gly Asn Asp Trp Gin Ser Ala Gly Ala Phe He Ser Asp Gly Ser Ala 
CCGCCCTGTCTCAAGTCACGAACCCGGATGGGTCAACAACGAATCTGATT 

i . . . . i . i , . 

Alo Ala Leu Ser Gin Val Thr Asn Pro Asp Gly Ser Thr Thr Asn Leu He 

TTTGACGTGCACAAATACTTGGACTCAGACAACTCCGGTACTCACGCCGA 

■ ■ 1 - ■ - ■ ... ■ . ■ 

Phe Asp Val His Lys Tyr Leu Asp Ser Asp Asn Ser Gly Thr His Ala Glu 
ATGTACTACAAATAACATTGACGGCGCCTTTTCTCCGCTTGCCACTTGGC 



1300 



1350 



1400 



Cys Thr Thr Asn Asn lie Asp Gly Ala Phe Ser Pro Leu Ala Thr Trp 
TCCGACAGAACAATCGCCAGGCTATCCTGACAGAAACCGGTGGTGGCAAC 

Leu Arg Gin Asn Asn Arg Gin Ala He Leu Thr Glu Thr Gly Gly Gly Asn 
GTTCAGTCCTGCATACAAGACATGTGCCAGCAAATCCAATATCTCAACCA 

Val Gin Ser Cys lie Gin Asp Met Cys Gin Gin He Gin Tyr Leu Asn Gin 
GAACTCAGATGTCTATCTTGGCTATGTTGGTTGGGGTGCCGGATCATTTG 

... I .... 1 ■ . . . 1 - A . _ ■ ■ 1 ■ . . , ... | 

Asn Ser Asp Vol Tyr Leu Gly Tyr Val Gly Trp Gly Ala Gly Ser Phe 



1450 



1500 
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1600 
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ATAGCACGTATGTCCTGACGGAAACACCGACTAGCAGTGGTAACTCATG6 

. . - . • I ■ *•••'* - i j r ,. ..... i 

Asp Ser Thr Tyr Vol Leu Thr Glu Thr Pro Thr Ser Ser Gly Asn Ser Trp 

ACGGACACATCCTTGGTCAGCTCGTGTCTCGCAAGAAAGTAGCACTCTGA 

- > ' i . , . . i . . ...... i i 

Thr Asp Thr Ser Leu Val Ser Ser Cys Leu Ala Arg Lys • 
GCTGAATGCAGAAGCCTCGCCAACGTTTGTATCTCGCTATCAAACATAGT 

■ ■ ■ ■ ■ 1 i 

AGCTACTCTATGAGGCTGTCTGTTCTCGATTTCAGCTTTATATAGTTTCA 

■ i i i ■ i I i — ' 

TCAAACAGTACATATTCCCTCTGTGGCCACGCAAAAAAAAAAAAAAAAA 



1650 



1700 
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1800 
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GGGTGGTCTGGATGAAACGTCT TGGCCAAATCGTGATCGATTGATACTCG 

| - - ■ - i I .... i . - - - I - .... I I gQ 

CATCTATAAGATGGCACAGATCGACTCTTGATTCACAGACATCCGTCAGC 

CCTCAAGCCGTTTGCAAGTCCACAAACACAAGCACAAGCATAGCGTCGCA 
, , , , , ! , , , , , , t 15 q 

ATGAAGTTCCTTCAAGTCCTCCCTGCCCTCATACCGGCCGCCCTGGCCCA 
i i ■ i i i i i 200 

Met Lys Phe Leu Gin Vol Leu Pro Ala Leu He Pro Ala Ala Leu Ala Gin 

t 

AACCAGCTGTGACCAGTGGGCAACCTTCACTGGCAACGGCTACACAGTCA 

i i .... i ... ■ i .... i .... i i 250 

Thr Ser Cys Asp Gin Trp Ala Thr Phe Thr Gly Asn Gly Tyr Thr Vol 

GCAACAACCTTTGGGGAGCATCAGCCGGCTCTGGATTTGGCTGCGTGACG 
j , ' »- 300 

Ser Asn Asn Leu Trp Gly Ala Ser Ala Gly Ser Gly Phe Gly Cys Val Thr 

GCGGTATCGCTCAGCGGCGGGGCCTCCTGGCACGCAGACTGGCAGTGGTC 

■ ■ ■ ■ ■ ■ ■ ■ ' ' — ■ —J — ' ■ — «- 350 

Ala Val Ser Leu Ser Gly Gly Ala Ser Trp His Ala Asp Trp Gin Trp Ser 
CGGCGGCCAGAACAACGTCAAGTCGTACCAGAACTCTCAGATTGCCATTC 



i i .... i .... i .... i .... i 



400 



Gly Gly Gin Asn Asn Val Lys Ser Tyr Gin Asn Ser Gin He Ala lie 
CCCAGAAGAGGACCGTCAACAGCATCAGCAGCATGCCCACCACTGCCAGC 

> * , i I , ' ■ - 1 - - - • 1 » - - ' 

Pro Gin Lys Arg Thr Val Asn Ser He Ser Ser Met Pro Thr Thr Alo Ser 
TGGAGCTACAGCGGGAGCAACATCCGCGCTAATGTTGCGTATGACTTGTT 

Trp Ser Tyr Ser Gly Ser Asn lie Arg Ala Asn Val Ala Tyr Asp Leu Phe 



450 



500 



CACCGCAGCCAACCCGAATCATGTCACGT ACTCGGGAGACTACGAACTCA 
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TGATCTGGTAAGCCATAAGAAGTGACCCTCCTTGATAGTTTCGACTAACA 
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Met lie Trp- 
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Tyr Asn Gly Ala Met Gin Val Tyr Ser Phe Val Ala Gin Thr Asn Thr Thr 
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Bsml 
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Scal/Smal 
EcoRI 




• Digest with Bsml and EcoRI 

• Isolate 300bp Bsml/EcoRI Fragment 
•Digest pUC218 with SstI and EcoRI 
•Ligate pUC21 8 Sstl/EcoRl and Baml/EcoRI 

fragment with the following synthetic oligonucleotides (SEQ. ID NO:37) 

CGTAOAGCGTTGACTTGCCTGTGGTCTGTCCAGACGGGGGACGATAGAATGCG 
TCGAGCATCTCGCAACTGAACGGACACCAQACAQGTCTGCCCCCTGCTATCTTAC 

SstI , Bsml 



I 



Bsml 
SstI 
HindlE 




SstI 
Scal/Smal 
EcoRI 



f 

• Digest pEG1T with Hindlll and Bsml and Isolate vector fragment 

• Digest pUC218::EG1 with Hindm and SstI and Isolate 2.3 kb EG1 fragment 
•Ligate pEGlT Hindlll/BsmI and 2.3 Kb Hindlll/SstI with the 
following synthetic oligonucleotides 

CGTAGAGCGTTGACTTGCCTGTGGTCTGTCCAGACGGGGGACGATAGAATGCG 
TCGAGCATCTCGCAACTGAACGGACACCAGACAGGTCTGCCCCCTGCTATCTTAC 

SstI 8sml 
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Hindm 
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Linker replaces DNA 
between Sstl/BsmI sites 
Scal/Smal 



EcoRl 




EcoRI 



• Digest p219M with EcoRI and Hindm 

• Isolate 1 .6Kb EcoRl/Hindm pyr4 gene fragment 

• Digest pUC218 with EcoRI SstI and dephosphorylate the 
ends with calf alkaline phosphotase 

• Isolate the Hindm/EcoRI EG1 fragment from pEG1 A3' 

• Ligate together pUC18 EcoRI, EcoRI/Hindm pyr4 gene 
fragment and Hindin/EcoRI EG1 fragment 



t 



EcoRI 




pUC18 JT^ Hindm 



EcoRI 
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Nhel 



Bgin 
Xbal 

Hindm 



• Partial digest with Xbal. Isolate Linear vector fragment. Fill in the overhangs 
with T4 DNA polymerase and dNTPs and ligate the blunt ends together. 
This has the effect of destroying the filled in site. 

• Select a plasmid in which the Xbal site in the polylinker has been destroyed. 

• Digest the plasmid with the polylinker Xbal site destroyed with Xbal and SnaBI. 

• Isolate the vector fragment and ligate with the following synthetic oligonucleotides 

5'CTA GAG GAG CGG TCG GGA ACC GCT AC 3' 
3' TC CTC GCC AGC CCT TGG CGA TG 5' 

Xbal SnaBI 

• This effectively replaces the CBD with a linker that joins the signal peptidase 
signal with the papain cleavage point in the linker domain of the CBHII protein 



Xbal 



SnaBI 




CBHI 
terminator 



EcoRI 



Digest pUCACBD CBHII with Nhel 
Blunt the Nhel overhangs using T4 
DNA polymerase and dNTPs 
Digest with Bglll and isolate the Nhel 
(blunt) - Bglll cbh2 core domain fragment 

Ligate together with the synthetic 
oligonucleotide cgctag to link the Bglll 
overhang with t he Sstll overhang 

cbhl Terminator f 




Nhel 
NotI 



CBHI 
terminator 



NotI 
PstI 



CBHI 
Sstn ' Promoter 

Digest with Sstll and Pmel and 
isolate the vector fragment 



NotI 



PUC1180 
NotI 



FIG..9 



cbhl 

terminator 




pTEXCBHIlcore | cbhJ 

promoter 



Nhel(blunt)/Pmel ^^^^ Bglll/SstH 
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